New version of Gemini beats other AIs at math, science, and reasoning

Don't try to beat it at chess.
 By 
Stan Schroeder
 on 
Gogle Gemini
Google's new LLM is the king of the hill when it comes to nearly everything. Credit: Sopa Images / Getty Images

Google's new Gemini Pro is smarter than other AIs at reasoning, science, and coding.

This is according to a series of benchmark results posted by Google on Thursday. In short, Gemini 2.5 Pro beats chief competitors at nearly everything — though we're sure the companies behind those competitors would disagree.

According to Google's data, Gemini 2.5 Pro has a healthy lead over OpenAI o3, Claude Opus 4, Grok 3 Beta, and DeepSeek R1, in the Humanity's Last Exam benchmark, which evaluates a model's math, science, knowledge, and reasoning. It's also better at code editing (per the Aider Polyglot benchmark), and it wins over all competitors in several factuality benchmarks including FACTS Grounding, meaning it's less likely to provide factually inaccurate text.


You May Also Like

The only benchmark in which Gemini 2.5 Pro isn't a clear winner is the mathematics-focused AIME 2025, and even there the differences between results are pretty small.

As a result of all the improvements in Gemini 2.5 Pro, this model is now on top of the LMArena leaderboard with a score of 1470.

There's a catch, though: The final version of Gemini 2.5 Pro isn't widely available yet. Google calls this latest version an "upgraded preview," with a stable version coming "in a couple of weeks." The preview should now be available in the Gemini app, though.

Stan Schroeder
Stan Schroeder
Senior Editor

Stan is a Senior Editor at Mashable, where he has worked since 2007. He's got more battery-powered gadgets and band t-shirts than you. He writes about the next groundbreaking thing. Typically, this is a phone, a coin, or a car. His ultimate goal is to know something about everything.

Mashable Potato

Recommended For You
Google releases Gemini 3.1 Pro: Benchmark performance, how to try it
gemini 3.1 pro banner image from google

ChatGPT can now generate visuals for math and science lessons
A screenshot of a ChatGPT chat. The user asks "explain the pythagorean theorem." ChatGPT generates a side by side visual, with the formula on the left and a visual of a triangle on the right.

Google hit with shocking wrongful death lawsuit over Gemini AI chatbot
Google Gemini logo

Google Chrome unveils Gemini-powered auto-browsing feature
Chrome auto browse

Gemini will let you import ChatGPT, other chatbot conversations
A phone screen shows the blue Gemini logo.

Trending on Mashable
NYT Connections hints today: Clues, answers for April 3, 2026
Connections game on a smartphone

Wordle today: Answer, hints for April 3, 2026
Wordle game on a smartphone


The Earth is glowing in new Artemis II pictures of home
One half of the Earth is seen floating in space through the open door of the Orion spacecraft.

NYT Strands hints, answers for April 3, 2026
A game being played on a smartphone.
The biggest stories of the day delivered to your inbox.
These newsletters may contain advertising, deals, or affiliate links. By clicking Subscribe, you confirm you are 16+ and agree to our Terms of Use and Privacy Policy.
Thanks for signing up. See you at your inbox!