VoiceFrom delivers better translation accuracy than OpenAI's new GPT-Realtime-Translate
OpenAI shipped GPT-Realtime-Translate on May 8. We ran it through our head-to-head benchmark. VoiceFrom Pro leads on translation accuracy by a wide margin.
On May 8, OpenAI added GPT-Realtime-Translate to its Realtime API, the first OpenAI model purpose-built for live speech translation. The launch reshaped how the industry talks about live translation. It also prompted a question event producers have been asking us all week: how does the new OpenAI model compare to the live translation systems that already serve real events?
We ran it through the same head-to-head evaluation harness we use on our own product and three of the most-cited competitors: Google Meet, LiveVoice, and Palabra. The short version:
- VoiceFrom Pro is substantially more accurate than OpenAI’s new model on the industry-standard WMT-grade quality metric, on average across eight language pairs.
- OpenAI’s median latency is faster than VoiceFrom’s.
- Full data, scoring methodology, charts, and audio comparisons are in the engineering writeup below.
Full benchmark: Five platforms, one harness: a head-to-head live translation benchmark.
If accuracy is what your audience needs, VoiceFrom is the system to put on stage. Schedule a call and we will set up a pilot at your next event.