OpenAI brings video to ChatGPT Advanced Voice Mode

AVM finally gets vision capabilities
 By 
Cecily Mauran
 on 
The ChatGPT logo appears on the screen of a smartphone
ChatGPT now has vision capabilities for Advanced Voice Mode Credit: Jaque Silva / NurPhoto / Getty Images

ChatGPT's Advanced Voice Mode now has video and screenshare capabilities.

The feature was last May with the release of GPT-4o, but only the audio modality has been live. Now users can chat with ChatGPT using a phone camera and the model will "see" what you see.

In the livestream, CPO Kevin Weil and other OpenAI team members demoed ChatGPT assisting with how to make pour-over coffee. By pointing the camera at the action, AVM demonstrated that it understood the principle of the coffee maker and walked the team through the brewing of their beverage. The team also showed how ChatGPT supports screensharing by understanding an open message on a phone with Weil wearing a Santa beard.


You May Also Like

The long-awaited announcement comes a day after Google unveiled the next generation of its flagship model, Gemini 2.0. The new Gemini 2.0 can also process visual and audio inputs and has more agentic capabilities, meaning it can perform multi-step tasks on the user's behalf. Gemini 2.0's agent features currently exist as a research prototype under three different names: Project Astra for a universal AI assistant, Project Mariner for specific AI tasks, and Project Jules for developers.

Not to be outdone, OpenAI's demo showcased how ChatGPT's vision modality accurately identified objects — and was even interruptible. And yes, part of this included a Santa voice option in Voice Mode, complete with a deep, jolly voice and lots of "ho-ho-hos." You can chat with OpenAI's version of Santa by tapping the snowflake icon in ChatGPT. No word yet on whether the real Santa Claus contributed his voice for AI training or OpenAI used his voice without prior consent.

Oddly, when selecting the Santa voice in the ChatGPT app, the user is warned that the voice is only for people 13 and older.

Starting today, video and screenshare are available to ChatGPT Plus and Pro users, with Enterprise and Edu availability coming in Jan.

Topics ChatGPT OpenAI

Mashable Image
Cecily Mauran
Tech Reporter

Cecily is a tech reporter at Mashable who covers AI, Apple, and emerging tech trends. Before getting her master's degree at Columbia Journalism School, she spent several years working with startups and social impact businesses for Unreasonable Group and B Lab. Before that, she co-founded a startup consulting business for emerging entrepreneurial hubs in South America, Europe, and Asia. You can find her on X at @cecily_mauran.

Mashable Potato

Recommended For You
OpenAI is delaying its 'adult mode' for ChatGPT
By Jack Dawes
ChatGPT Update Will Allow 'erotica' For Verified Adult Users

OpenAI to finally bring ads to ChatGPT
Photo illustration of the chatgpt logo on a smartphone. The same logo can be seen faded in the background

OpenAI says it will change ChatGPT safety protocols in the wake of mass shooting
OpenAI logo

ChatGPT's sex-centered adult mode raises red flags at OpenAI
ChatGPT's erotica mode


Trending on Mashable
NYT Connections hints today: Clues, answers for April 3, 2026
Connections game on a smartphone

Wordle today: Answer, hints for April 3, 2026
Wordle game on a smartphone


NYT Strands hints, answers for April 3, 2026
A game being played on a smartphone.

What's new to streaming this week? (April 3, 2026)
A composite of images from film and TV streaming this week.
The biggest stories of the day delivered to your inbox.
These newsletters may contain advertising, deals, or affiliate links. By clicking Subscribe, you confirm you are 16+ and agree to our Terms of Use and Privacy Policy.
Thanks for signing up. See you at your inbox!