OpenAI announces GPT-4o, a multimodal voice assistant that's free for all ChatGPT users

Sam Altman teased the announcement last week as rumors swirled.
 By 
Cecily Mauran
 on 
OpenAI employees showing off new GPT-4o capabilities in a live demo
OpenAI's new GPT-4o model can help with math homework and a whole lot more. Credit: OpenAI

OpenAI has unveiled GPT-4o, a new AI model that combines text, vision, and audio.

At its highly anticipated livestream event, OpenAI CTO Mira Murati shared that GPT-4o can process text, audio, and vision in one model. GPT-4o will be available for free to all ChatGPT users. It is also available in the API, and is half the price and twice as fast as GPT -4 Turbo. The "o" in the name stands for "omni," referencing its combined modalities in one model.

GPT-4o voice capabilities

The announcement confirmed previous rumors of a voice assistant. Previously, there were separate models for voice and image modalities. But GPT-4o is "natively multimodal," said OpenAI CEO Sam Altman on X.


You May Also Like

Now, the GPT-4o brings the modalities together, decreasing lag and making it responsive in real-time. That means you can interrupt the model. It can also sense emotions and tones and express its own emotions and tones, making it sound extremely dramatic or robotic. It can even sing (if you want it to).

The soothing female voice used in the demo also sounds a lot like Scarlett Johansson's voice assistant character in the film Her.

GPT-4o vision capabilities

Another demo showcased GPT-4o's ability to help with math problems using its vision modality. It can walk the user through a basic math problem when solving for X. By highlighting code on the screen, ChaGPT with GPT-4o can process and understand what the code is and help improve it.

From user inquiries, ChatGPT with GPT-4o showed off its ability to translate in real-time and understand emotions.

Murati started the event by sharing the availability of a new desktop app.

Previously, OpenAI was rumored to announce a ChatGPT search engine or a new transformer model GPT-5 ahead of Google I/O. CEO Sam Altman shot down those rumors ahead of Monday's event, but they are still believed to be in development.

Topics ChatGPT OpenAI

Mashable Image
Cecily Mauran
Tech Reporter

Cecily is a tech reporter at Mashable who covers AI, Apple, and emerging tech trends. Before getting her master's degree at Columbia Journalism School, she spent several years working with startups and social impact businesses for Unreasonable Group and B Lab. Before that, she co-founded a startup consulting business for emerging entrepreneurial hubs in South America, Europe, and Asia. You can find her on X at @cecily_mauran.

Mashable Potato

Recommended For You
ChatGPT GPT-4o users are raging at OpenAI on Reddit right now
ChatGPT GPT-4o

GPT 5.4 arrives on ChatGPT: 5 improvements to know
open ai logo through magnifying glass

OpenAI is retiring GPT-4o, and the AI relationships community is heartbroken
illustration of chatgpt chat with the text 'i am not your husband'

OpenAI releases GPT-5.3-Codex, a coding model that helped build itself
chatgpt app logo on phone screen with same logo as background

OpenAI to finally bring ads to ChatGPT
Photo illustration of the chatgpt logo on a smartphone. The same logo can be seen faded in the background

Trending on Mashable
NYT Connections hints today: Clues, answers for April 3, 2026
Connections game on a smartphone

Wordle today: Answer, hints for April 3, 2026
Wordle game on a smartphone

What's new to streaming this week? (April 3, 2026)
A composite of images from film and TV streaming this week.

Google launches Gemma 4, a new open-source model: How to try it
Google Gemma

The biggest stories of the day delivered to your inbox.
These newsletters may contain advertising, deals, or affiliate links. By clicking Subscribe, you confirm you are 16+ and agree to our Terms of Use and Privacy Policy.
Thanks for signing up. See you at your inbox!