
The agentic Realtime model is a native speech-to-speech model that can be used to make customer service agents, phone reps and voice navigation features. It doesn’t go through speech-to-text and text-to-speech loops and generates audio «directly through a single model and API.» OpenAI is marketing this to developers who want more natural flowing speech, and it’s not available as distinct model in ChatGPT – yet. You can hear it and see it in use at places like Zillow, T-mobile, StubHub and Oscar Health, though. With general availability, it will surely show up in a lot more places soon.
More at: OpenAI’s launch page, discussion on r/OpenAI.
Read on for more news!
Continue reading “Friday roundup: A good week for coding, speech models”





Elon Musk just announced on x.com that the model 






