Google's Live Translate Now Speaks 70+ Languages
Gemini 3.5 Live Translate does near real-time voice-to-voice translation in 70+ languages, keeping your tone and pacing. It's coming to the Translate app.
Evgeny Arsentyev · PhDI travel with a phone full of half-useful translation screenshots, so this one landed for me personally. Google just unveiled Gemini 3.5 Live Translate, an audio model that does near real-time, voice-to-voice translation across more than 70 languages. It detects the language automatically, and instead of waiting politely for someone to finish a sentence, it translates continuously — staying just a few seconds behind the speaker, even in a noisy room.
The detail that matters most to me isn't the language count, it's how the output sounds. The model preserves the speaker's intonation, pacing and pitch, so a translated voice carries the same rhythm as the original instead of the flat robot monotone we've all suffered through. Google describes the core trick as generating speech continuously while "balancing the trade-off between waiting for context to improve quality and translating immediately to stay in sync." That tension — accuracy versus lag — is the whole game in live translation, and they're betting on staying in sync.
Where you'll actually get it
This isn't a research demo locked in a lab. For regular people, it's rolling out in the Google Translate app on Android and iOS, globally. Android gets a new "listening mode" that pipes the translation straight through your phone's earpiece, so you can hear a conversation translated in your ear as it happens. In Google Meet, the upgrade jumps from a measly five supported languages and English-only output to over 2,000 language combinations — that's the difference between a token feature and something a multinational team can lean on. Developers get it now through the Gemini Live API and Google AI Studio; enterprises get a private preview in Meet this month.
Why does this matter if you're not a developer or a diplomat? Because the language barrier is one of the last genuinely hard walls in everyday life — talking to a doctor abroad, a landlord, your partner's relatives, a supplier in another country. Text translation has been good enough for a while; live spoken translation that keeps a human cadence is a different category. It turns a stilted, exhausting exchange into something closer to an actual conversation.
One quiet but important touch: every audio output is watermarked with SynthID, Google's imperceptible marker for AI-generated content. In a year where AI-faked voices are a real scam vector, baking provenance into the audio from the start is the responsible default — and worth knowing exists.
Update the Google Translate app and try listening mode on your next call or trip with someone who doesn't share your language — even just ordering food or talking to a relative. Treat it as a confidence tool, not a contract tool: brilliant for everyday conversation, but for anything legal, medical or money-related, still confirm the important bits with a human. And remember the output is SynthID-watermarked, so a clip can be flagged as AI later.
I don't think this kills language learning — context, jokes and nuance still reward knowing a language yourself. But for the millions of small moments where you just need to be understood right now, the wall is getting a lot shorter.

Author
Evgeny Arsentyev
PhD · Chief Product Officer at a healthtech company
Want to actually build this?
Guides explain. The free course transforms — personalized, gamified, and built to get you shipping fast.
◉ Start the free courseSource: deepmind.google