
Share
PersonaPlex revolutionizes real-time conversations with its dual-stream configuration, enabling seamless full-duplex interactions that mimic natural human dialogue, complete with interruptions and rapid turn-taking.
NVIDIA has unveiled PersonaPlex, a groundbreaking real-time speech-to-speech conversational model that excels in full-duplex interactions. This model is designed to handle continuous audio streams, enabling natural conversational dynamics like interruptions and rapid turn-taking. Here’s what changed technically and why it matters for practitioners:
Dual-Stream Configuration: PersonaPlex operates on a dual-stream setup where listening and speaking occur simultaneously. This allows the model to update its internal state based on ongoing user speech while producing fluent output audio.
Neural Codec Integration: The model uses a neural codec to encode continuous audio streams. This codec compresses the audio into a sequence of tokens that the model can process efficiently.
Conditional Prompts: Before the conversation begins, PersonaPlex is conditioned on two types of prompts:

Model Architecture:
Benchmarks:
For more information on NVIDIA’s latest open models and developer tools, including Nemotron and Riva Speech, visit the NVIDIA Developer Portal at developer.nvidia.com.
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
16 February 2026
88 articles
Related Articles

OpenEvidence Targets Hospitals to Expand Its AI Chatbot for Doctors
Products & Applications · 3 min

OpenEvidence Launches Voice AI to Enhance Physician Workflow
Products & Applications · 3 min

Doximity Accelerates AI Investment in 2026, Targeting Multibillion-Dollar Market
Products & Applications · 3 min
Related Articles

OpenEvidence Targets Hospitals to Expand Its AI Chatbot for Doctors
Products & Applications · 3 min

OpenEvidence Launches Voice AI to Enhance Physician Workflow
Products & Applications · 3 min

Doximity Accelerates AI Investment in 2026, Targeting Multibillion-Dollar Market
Products & Applications · 3 min
More Stories