
Share
DeepMind unveils breakthroughs in audio generation, crafting lifelike multi-speaker dialogues that push the boundaries of natural speech synthesis, paving the way for more immersive AI interactions.
Zalán Borsos, Matt Sharifi, and Marco Tagliasacchi | October 30, 2024
DeepMind has been at the forefront of speech generation technology, developing models that can produce high-quality, natural-sounding audio from various inputs. This work is crucial for creating more engaging and intuitive digital assistants and AI tools. Over the past few years, we've made significant strides in this area, particularly with the development of models capable of generating long-form, multi-speaker dialogue.
Single-Speaker Audio: Our technology powers single-speaker audio in several Google products, including:
Multi-Speaker Dialogue:
In our previous work on SoundStorm, we demonstrated the ability to generate 30-second segments of natural dialogue between multiple speakers. This was a significant step forward, as it extended our earlier research on:

Neural Audio Codecs:
Text-to-Audio Models:
NotebookLM Audio Overviews:
Illuminate:
We are committed to continuing our research in audio generation, with a focus on improving naturalness, coherence, and efficiency. Our goal is to make digital assistants and AI tools more intuitive and engaging for everyone.
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
31 October 2024
88 articles
Related Articles
Related Articles
More Stories