Model Release

DeepMind Launches Gemini 3.5 Live Translate for Real-Time Voice Translation

DeepMind's Gemini 3.5 Live Translate supports translation in over 70 languages, rolling out today across Google products.

Image: Google DeepMind

DeepMind has released Gemini 3.5 Live Translate, an audio model that enables near real-time speech-to-speech translation across 70 languages. The model automatically detects languages and generates smooth, natural-sounding translations that preserve the speaker's intonation, pacing, and pitch. It is designed to provide fluid audio without awkward pauses, staying just a few seconds behind the speaker during the session. This advancement aims to improve communication across languages in real-time settings. Source: deepmind

Gemini 3.5 Live Translate is being rolled out starting today across Google products, including Google Meet for enterprises in private preview and Google Translate on Android and iOS for all users. The model processes speech as it is streamed, enabling seamless multilingual communication without manual configuration. It also features noise robustness, allowing it to function effectively in loud or unpredictable environments. Developers can integrate the model via the Gemini Live API, which supports platforms like Agora, Fishjam, LiveKit, Pipecat, and Vision Agents. These integrations handle real-time media streaming, allowing developers to focus on user experience. Source: deepmind

DeepMind's initiative builds on its 20-year history of machine learning experiments in translation, which have translated over a trillion words for billions of users monthly. The company emphasized that Gemini 3.5 Live Translate represents a next step in this evolution, offering improvements such as support for over 70 languages and enabling conversations across over 2000 language combinations in one meeting. Source: deepmind

Key points

Gemini 3.5 Live Translate supports translation in over 70 languages.
The model automatically detects 70+ languages and generates smooth, natural-sounding translated speech.
Gemini 3.5 Live Translate processes speech as it’s streamed, enabling a more seamless connection across languages.
The model is rolling out starting today across Google products, including Google Meet for enterprises in private preview and Google Translate on Android and iOS for all users.
Gemini 3.5 Live Translate is being integrated with platforms like Agora, Fishjam, LiveKit, Pipecat, and Vision Agents via the Gemini Live API.
DeepMind's translation efforts began as one of its pioneering machine learning experiments, translating over a trillion words for billions of users monthly.

Source: Google DeepMind Read the original →

WRITTEN BY

Alex Lindgren

LLMs & Frontier Models

Alex covers the large language models and their impact on society.

DeepMind Launches Gemini 3.5 Live Translate for Real-Time Voice Translation

Key points

Related articles

Anthropic's Claude Opus 5 Costs Less Than Fable 5 While Matching Performance

Anthropic Releases Opus 5 Focused on Token Efficiency

Moonshot AI's Kimi K3 Sparks US-China AI Race

Kimi K3 Sparks AI Panic Amid U.S. Industry Reactions