OpenAI Revolutionizes API with Advanced Voice Features
Le brief IA que les pros lisent chaque soir
Les 7 actus IA du jour, décryptées en 5 min. Gratuit.
Inclus dès l'inscription : notre sélection des meilleurs guides & comparatifs IA.
Choisis ton rythme
Gratuit · Pas de spam · Désabonnement en 1 clic
OpenAI recently unveiled a series of new voice intelligence features integrated into its API, aimed at transforming how developers can create interactive applications. These innovations allow applications to speak, transcribe, and translate conversations in real-time with users.
The voice model GPT-Realtime-2 stands out for its ability to simulate realistic voice conversations. Built on the reasoning framework of GPT-5, this model is designed to handle more complex user queries than its predecessor, GPT-Realtime-1.5.
Among the new features, GPT-Realtime-Translate distinguishes itself with its real-time translation services. This model is capable of understanding over 70 input languages and providing translations in 13 output languages, all in a smooth and conversational manner.
Additionally, OpenAI has introduced GPT-Realtime-Whisper, a live transcription feature that converts speech to text as interactions occur.
These innovations aim to transform real-time audio interactions, evolving from a simple question-and-answer exchange to voice interfaces capable of listening, reasoning, translating, transcribing, and acting during a conversation, according to OpenAI.
Businesses, particularly those focused on customer service, are expected to benefit from these updates. However, OpenAI emphasizes that these tools can also be useful in various fields such as education, media, events, and creative platforms.
Aware of the potential risks of misuse, OpenAI has implemented safeguards to prevent spam, fraud, and other online abuses. Built-in triggers allow for the interruption of conversations that violate guidelines on harmful content.
All these new voice models are available in OpenAI's Realtime API. The Translate and Whisper services are billed by the minute, while GPT-Realtime-2 is priced based on token consumption.
Brief IA — L'actualité IA en français
L'essentiel de l'actualité de l'intelligence artificielle, décrypté et expliqué chaque jour.