
Share
OpenAI’s new Voice Engine turns text into lifelike speech with just a brief audio clip, offering realistic voices for AI applications while sparking debates on the technology's ethical implications.
OpenAI has been making waves with its AI models, but their latest venture into synthetic voices is particularly intriguing. In a recent small-scale preview, they introduced Voice Engine, a model that generates natural-sounding speech from text input and a 15-second audio sample. This technology not only creates realistic and emotive voices but also raises important questions about safety and ethical deployment.
Voice Engine was first developed in late 2022 and has since been integrated into OpenAI's text-to-speech API, as well as powering features like ChatGPT Voice and Read Aloud. Here are the key technical details:
To explore the potential of Voice Engine, OpenAI started private testing with a select group of trusted partners. These early applications have shown promising results:

While the potential applications of synthetic voices are vast, OpenAI is taking a cautious approach due to the risks of misuse. Here’s what they’re doing:
Based on the results of these small-scale tests and ongoing conversations, OpenAI will make an informed decision about broader deployment. The goal is to balance innovation with safety, ensuring that synthetic voices are used for good across various industries.
Voice Engine represents a significant step forward in text-to-speech technology, offering new possibilities for education, content creation, and more. However, the responsible development and deployment of this technology remain crucial. OpenAI's cautious approach is commendable, as it ensures that the benefits of synthetic voices are realized while minimizing potential harms.
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
2 April 2024
88 articles
Related Articles

OpenEvidence Targets Hospitals to Expand Its AI Chatbot for Doctors
Products & Applications · 3 min

OpenEvidence Launches Voice AI to Enhance Physician Workflow
Products & Applications · 3 min

Doximity Accelerates AI Investment in 2026, Targeting Multibillion-Dollar Market
Products & Applications · 3 min
Related Articles

OpenEvidence Targets Hospitals to Expand Its AI Chatbot for Doctors
Products & Applications · 3 min

OpenEvidence Launches Voice AI to Enhance Physician Workflow
Products & Applications · 3 min

Doximity Accelerates AI Investment in 2026, Targeting Multibillion-Dollar Market
Products & Applications · 3 min
More Stories