Tech

OpenAI’s Deep Research smashes records for the world’s hardest AI exam, with ChatGPT o3-mini and DeepSeek left in its wake

Share
Share


  • The accuracy achieved by the top-scoring AI in the world’s hardest benchmark as improved by 183% in just two weeks
  • ChatGPT o3-mini now scores up to 13% accuracy depending on capacity
  • OpenAI Deep Research obliterates competition with 26.6% accuracy result

The world’s hardest AI exam, Humanity’s Last Exam, was launched less than two weeks ago, and we’ve already seen a huge jump in accuracy, with ChatGPT o3-mini and now OpenAI’s Deep Reasoning topping the leaderboard.

The AI benchmark created by experts from around the world contains some of the hardest reasoning problems and questions known to man – it’s so hard, that when I previously wrote about Humanity’s Last Exam in the article linked above, I couldn’t even understand one of the questions, let alone answer it.

Share

Leave a comment

Leave a Reply

Your email address will not be published. Required fields are marked *

Related Articles
NYT Connections hints and answers for Sunday, May 4 (game #693)
Tech

NYT Connections hints and answers for Sunday, May 4 (game #693)

Looking for a different day? A new NYT Connections puzzle appears at...

NYT Strands hints and answers for Sunday, May 4 (game #427)
Tech

NYT Strands hints and answers for Sunday, May 4 (game #427)

Looking for a different day? A new NYT Strands puzzle appears at...

Quordle hints and answers for Sunday, May 4 (game #1196)
Tech

Quordle hints and answers for Sunday, May 4 (game #1196)

Looking for a different day? A new Quordle puzzle appears at midnight...

We just got another big hint that the Samsung Galaxy S25 FE is on the way
Tech

We just got another big hint that the Samsung Galaxy S25 FE is on the way

References to Galaxy S25 FE firmware have appeared The phone could launch...