
Share
ElevenLabs unveils Conversational AI 2.0, boasting a state-of-the-art turn-taking model and multilingual capabilities to enhance natural interactions and security in enterprise settings like customer support and marketing.
ElevenLabs, the voice and AI sound effects startup founded by former Palantir engineers, has just unveiled Conversational AI 2.0-a significant upgrade to its platform for building advanced voice agents. This new version introduces several key features aimed at enhancing natural conversations, ensuring secure interactions, and supporting multilingual environments. These improvements make it particularly well-suited for enterprise applications like customer support, call centers, and outbound sales and marketing.
One of the most notable upgrades in Conversational AI 2.0 is its advanced turn-taking model. This technology addresses a common issue in traditional voice systems: awkward pauses or interruptions that can disrupt the flow of conversation. Here’s how it works:
Conversational AI 2.0 also introduces integrated language detection, which simplifies multilingual interactions:

In addition to these user-facing improvements, ElevenLabs has also focused on enhancing security and enterprise readiness:
The launch of Conversational AI 2.0 comes just four months after the debut of the original platform, showcasing ElevenLabs' commitment to rapid development. This quick turnaround is also a response to competition from other startups like Hume, which recently launched its own turn-based voice AI model, EVI 3.
Despite some early skepticism and declarations that ElevenLabs was "dead" due to the rise of open-source AI voice models, Jozef Marko from ElevenLabs' engineering team is confident in the platform's capabilities. According to Marko, Conversational AI 2.0 sets a new standard for voice-driven experiences, demonstrating that proprietary solutions can still offer significant advantages over open-source alternatives.
ElevenLabs' Conversational AI 2.0 represents a substantial leap forward in voice agent technology. With its advanced turn-taking model, multilingual support, and enterprise-grade features, it is well-positioned to meet the evolving needs of businesses looking to enhance their customer interactions through natural, intelligent, and secure voice experiences.
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
2 June 2025
88 articles
Related Articles

OpenEvidence Targets Hospitals to Expand Its AI Chatbot for Doctors
Products & Applications · 3 min

OpenEvidence Launches Voice AI to Enhance Physician Workflow
Products & Applications · 3 min

Doximity Accelerates AI Investment in 2026, Targeting Multibillion-Dollar Market
Products & Applications · 3 min
Related Articles

OpenEvidence Targets Hospitals to Expand Its AI Chatbot for Doctors
Products & Applications · 3 min

OpenEvidence Launches Voice AI to Enhance Physician Workflow
Products & Applications · 3 min

Doximity Accelerates AI Investment in 2026, Targeting Multibillion-Dollar Market
Products & Applications · 3 min
More Stories