Open AI and Google trained AI models on YouTube videos

The two tech giants transcribed YouTube videos, which may violate creator copyrights.
 By 
Elena Cavender
 on 
A phone screen displaying the YouTube logo mirrored.
Tech companies are desperate to harvest as much data as possible to train their AI models. Credit: SOPA Images / Contributor / Lightrocket via Getty Images

Both OpenAI and Google turned to transcribing YouTube videos to further train their AI models, which may violate creators' copyrights, the New York Times reports. The report details how the two tech giants, along with Meta, cut corners to access as much data as possible to train their AI models.

According to the report, OpenAI used Whisper, a speech recognition tool, to transcribe more than one million hours of YouTube videos. It then fed the transcripts into GPT-4, the powerful AI system that the latest model of ChatGPT's chatbot runs on. Google, which owns YouTube, also transcribed YouTube videos to train its AI models.

The transcription of videos by both companies may infringe on creator's copyrights to their videos. Other uses of creator content to train AI has prompted copyright and licensing lawsuits.


You May Also Like

OpenAI's use of YouTube videos also may violate Google's rules, which prohibits the use of its videos for "independent" applications and "automated means (such as robots, botnets or scrapers)" of accessing its videos.

Matt Bryant, a spokesperson for Google, told the New York Times that the company was unaware of any such use by OpenAI. But the report alleges that people at Google knew about OpenAI's unauthorized use of YouTube videos and neglected to take action because it was doing the same thing. Google also told the paper that it only trains its AI on videos from creators who have agreed for their content to be used in this manner.

In July 2023, Google changed its terms of service to allow the use public online material like Google Docs and Google Maps restaurant reviews to further train its AI models.

Mashable Image
Elena Cavender

Elena is a tech reporter and the resident Gen Z expert at Mashable. She covers TikTok and digital trends. She recently graduated from UC Berkeley with a BA in American History. Email her at [email protected] or follow her @ecaviar_.

Mashable Potato

Recommended For You
Google launches Gemma 4, a new open-source model: How to try it
Google Gemma

'Heated Rivalry' star Connor Storrie embraces childhood YouTube videos as 'self-acceptance'
Connor Storrie announces SAG Awards nominations in Los Angeles

The best Apple Watch deals to shop during Amazon's Big Spring Sale — save on Series 11 and SE 3 models
Two Apple Watches against a colorful background.

Google Veo 3.1 will generate social-ready vertical videos in Gemini
google gemini and veo 3.1 logos

How to watch the 2026 Australian Open online for free
Spain's Carlos Alcaraz hits a return

Trending on Mashable
NYT Connections hints today: Clues, answers for April 3, 2026
Connections game on a smartphone

Wordle today: Answer, hints for April 3, 2026
Wordle game on a smartphone

NYT Connections hints today: Clues, answers for April 2, 2026
Connections game on a smartphone


You can track Artemis II in real time as Orion flies to the moon
Victor Glover and Reid Wiseman piloting the Orion spacecraft
The biggest stories of the day delivered to your inbox.
These newsletters may contain advertising, deals, or affiliate links. By clicking Subscribe, you confirm you are 16+ and agree to our Terms of Use and Privacy Policy.
Thanks for signing up. See you at your inbox!