OpenAI Launches Three Real-Time Audio Models for Voice Agents

Written by

OpenAI unveiled three new audio models in 2026 aimed at developers building voice-based applications, with a focus on real-time task completion and more interactive voice agents.

The three models each serve a distinct function. GPT-Realtime-2 is designed to handle complex requests and manage interruptions during live conversations. GPT-Realtime-Translate provides live translation across multiple languages. GPT-Realtime-Whisper delivers instant speech-to-text conversion, suited for use cases such as live captions and note-taking.

Companies already testing the tools include Zillow and Priceline, signaling early enterprise interest in deploying the models for real-world applications.

The release reflects OpenAI’s stated goal of making voice agents more capable of completing tasks in real time, which may expand how businesses integrate spoken-language interfaces into their products and services.

Source: Tech-Economic Times

This article was generated by AI and cites original sources.

OpenAI Launches Three Real-Time Audio Models for Voice Agents

More posts

Jury Rejects Elon Musk’s Lawsuit Against OpenAI

Baidu Beats Quarterly Estimates as AI-Driven Cloud Growth Offsets Advertising Decline

Pope Leo XIV to Launch AI Encyclical on May 25 with Anthropic Co-Founder Present

Infosys Cuts Average Employee Bonus to 70% in Q4 FY2026, Down 15 Percentage Points