Tuesday, July 22, 2025

OpenAI and Google outdo the mathletes, however not one another

AI fashions from OpenAI and Google DeepMind achieved gold-medal scores within the 2025 Worldwide Math Olympiad (IMO), one of many world’s oldest and most difficult excessive school-level math competitions, the businesses independently introduced in latest days.

The outcomes underscore simply how briskly AI methods are advancing, and but, how evenly matched Google and OpenAI appear to be within the AI race. AI firms are competing fiercely for the general public notion of being forward within the AI race: an intangible battle of “vibes” that may have huge implications for securing high AI expertise. Lots of AI researchers come from backgrounds in aggressive math, so benchmarks like IMO imply greater than others.

Final 12 months, Google scored a silver medal at IMO utilizing a “formal” system, which means it required people to translate issues right into a machine‑readable format. This 12 months, each OpenAI and Google entered “casual” methods into the competitors, which had been capable of ingest questions and generate proof‑primarily based solutions in pure language. Each firms declare their AI fashions appropriately answered 5 out of six questions on IMO’s check, scoring larger than most highschool college students and Google’s AI mannequin from final 12 months, with out requiring any human-machine translation.

In interviews with TechCrunch, researchers behind OpenAI and Google’s IMO efforts claimed that these gold-medal performances symbolize breakthroughs round AI reasoning fashions in non-verifiable domains. Whereas AI reasoning fashions are likely to do properly on questions with simple solutions, similar to simple arithmetic or coding duties, these methods battle on duties with extra ambiguous options, similar to shopping for an awesome chair or serving to with advanced analysis.

Nevertheless, Google is elevating questions round how OpenAI carried out and introduced its gold-medal IMO efficiency. In any case, if you happen to’re going to enter AI fashions right into a math contest for top schoolers, you may as properly argue like youngsters.

Shortly after OpenAI introduced its feat on Saturday morning, Google DeepMind’s CEO and researchers took to social media to slam OpenAI for asserting its gold medal prematurely — shortly after IMO introduced which excessive schoolers had received the competitors on Friday night time — and for not having their mannequin’s check formally evaluated by IMO.

Thang Luong, a Google DeepMind senior researcher and lead for the IMO venture, advised TechCrunch that Google waited to announce its IMO outcomes to respect the scholars collaborating within the competitors.

Techcrunch occasion

San Francisco
|
October 27-29, 2025

Luong mentioned that Google has been working with IMO’s organizers since final 12 months in preparation for the check and wished to have the IMO president’s blessing and official grading earlier than asserting its official outcomes, which it did on Monday morning.

“The IMO organizers have their grading guideline,” Luong mentioned. “So any analysis that’s not primarily based on that guideline couldn’t make any declare about gold-medal stage [performance].”

Noam Brown, a senior OpenAI researcher who labored on the IMO mannequin, advised TechCrunch that IMO reached out to OpenAI a number of months in the past about collaborating in a proper math competitors, however the ChatGPT-maker declined as a result of it was engaged on pure language methods that it thought had been extra price pursuing. Brown says OpenAI didn’t know IMO was conducting a casual check with Google.

OpenAI says it employed third-party evaluators — three former IMO medalists who understood the grading system — to grade its AI mannequin’s efficiency. After OpenAI realized of its gold-medal rating, Brown mentioned the corporate reached out to IMO, which then advised the corporate to attend to announce till after IMO’s Friday night time award ceremony.

IMO didn’t reply to TechCrunch’s request for remark.

Google isn’t essentially flawed right here — it did undergo a extra official, rigorous course of to realize its gold-medal rating — however the debate might miss the larger image: AI fashions from a number of main AI labs are enhancing rapidly. Nations from world wide despatched their brightest college students to compete at IMO this 12 months, and only a few p.c of them scored in addition to OpenAI and Google’s AI fashions did.

Whereas OpenAI used to have a major lead over the trade, it definitely feels as if the race is extra carefully matched than any firm want to admit. OpenAI is anticipated to launch GPT-5 within the coming months, and the corporate definitely hopes to present off the impression that it nonetheless leads the AI trade.


Related Articles

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Latest Articles