Google I/O: Google announces new safety framework for responsible AI

Google's red team takes on misinformation and cyber security.
 By 
Chase DiBenedetto
 on 
A man stands in front of a large screen that reads "Responsible AI: Human insight and safety testing."
Google unveils AI-assisted red teaming to build safer genAI. Credit: Google

As Google positions its upgraded generative AI as teacher, assistant, and recommendation guru, the company is also trying to turn its models into a bad actor's worst enemy.

"It's clear that AI is already helping people," said James Manyika, Google's senior vice president of research, technology, and society, to the crowd at the company's Google I/O 2024 conference. "Yet, as with any emerging technology, there are still risks, and new questions will arise as AI advances and its uses evolve."

Manyika then announced the company's latest evolution of red teaming, an industry standard testing process to find vulnerabilities in generative AI. Google's new "AI-assisted red teaming" trains multiple AI agents to compete with each other to find potential threats. These trained models can then more accurately pinpoint what Google calls "adversarial prompting" and limit problematic outputs.

The process is the company's new plan for building a more responsible, humanlike AI, but its also being sold as a way to address growing concerns about cyber security and misinformation.

The new safety measures incorporate feedback from a team of experts across tech, academia, and civil society, Google explained, as well as its seven principles of AI development: Being socially beneficial, avoiding bias, building and testing for safety, human accountability, privacy design, upholding scientific excellence, and public accessibility. Through these new testing efforts, and industry-wide commitments, Google's attempting to putting product where its words are.

Topics Google

Chase sits in front of a green framed window, wearing a cheetah print shirt and looking to her right. On the window's glass pane reads "Ricas's Tostadas" in red lettering.
Chase DiBenedetto
Social Good Reporter

Chase joined Mashable's Social Good team in 2020, covering online stories about digital activism, climate justice, accessibility, and media representation. Her work also captures how these conversations manifest in politics, popular culture, and fandom. Sometimes she's very funny.

Mashable Potato

Recommended For You

Seth Meyers unpacks Trump's detail-free 'framework of a deal' on Greenland
Seth Meyers presents "Late Night" beside an image of Donald Trump

Ring and Flock Safety cancel partnership amidst surveillance criticism
A Ring Outdoor Cam Pro camera during a media preview at Amazon's headquarters in Seattle, Washington, US, on Wednesday, Sept. 24, 2025.

Meta loses major child safety trial, ordered to pay $375 million
Meta CEO Mark Zuckerberg appears outside of LA courthouse after testifying in social media addiction trial.

Child safety group blasts YouTube for million dollar gamble on AI content for kids
A blurry YouTube app icon over a large glowing YouTube play logo.

Trending on Mashable
NYT Connections hints today: Clues, answers for April 3, 2026
Connections game on a smartphone

Wordle today: Answer, hints for April 3, 2026
Wordle game on a smartphone

Google launches Gemma 4, a new open-source model: How to try it
Google Gemma

NYT Strands hints, answers for April 3, 2026
A game being played on a smartphone.

Wordle today: Answer, hints for April 4, 2026
Wordle game on a smartphone
The biggest stories of the day delivered to your inbox.
These newsletters may contain advertising, deals, or affiliate links. By clicking Subscribe, you confirm you are 16+ and agree to our Terms of Use and Privacy Policy.
Thanks for signing up. See you at your inbox!