
Share
Google Meet's new real-time speech translation uses DeepMind’s advanced audio models to break down language barriers in virtual meetings, fostering global collaboration and inclusivity like never before.
Google announced at Google I/O 2025 that it’s bringing real-time speech translation to Google Meet. This new feature leverages a large language audio model from Google DeepMind, enabling natural and free-flowing conversations between participants speaking different languages. The integration of this technology into Google Meet is a significant step forward in making virtual meetings more accessible and inclusive for users worldwide.
The core technical advancement here is the deployment of a sophisticated speech-to-speech translation model. This model, developed by DeepMind, can process spoken words in real-time and translate them into the listener’s preferred language with minimal latency. Here are the key points:
To achieve this level of real-time translation, several technical components work in tandem:

Google has not released specific benchmarks for the new feature, but they have emphasized its performance in real-world scenarios. Early tests indicate:
For software engineers and developers, this update highlights several important trends and considerations:
Google Meet's new real-time speech translation feature is a significant leap forward in multilingual communication. By leveraging advanced DeepMind models, Google has created a tool that not only translates words but also preserves the essence of human interaction. This update sets a new standard for virtual meeting platforms and paves the way for more inclusive and accessible communication technologies.
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
21 May 2025
88 articles
Related Articles

OpenEvidence Targets Hospitals to Expand Its AI Chatbot for Doctors
Products & Applications · 3 min

OpenEvidence Launches Voice AI to Enhance Physician Workflow
Products & Applications · 3 min

Doximity Accelerates AI Investment in 2026, Targeting Multibillion-Dollar Market
Products & Applications · 3 min
Related Articles

OpenEvidence Targets Hospitals to Expand Its AI Chatbot for Doctors
Products & Applications · 3 min

OpenEvidence Launches Voice AI to Enhance Physician Workflow
Products & Applications · 3 min

Doximity Accelerates AI Investment in 2026, Targeting Multibillion-Dollar Market
Products & Applications · 3 min
More Stories