Tech

OpenAI’s Deep Research smashes records for the world’s hardest AI exam, with ChatGPT o3-mini and DeepSeek left in its wake

Share
Share


  • The accuracy achieved by the top-scoring AI in the world’s hardest benchmark as improved by 183% in just two weeks
  • ChatGPT o3-mini now scores up to 13% accuracy depending on capacity
  • OpenAI Deep Research obliterates competition with 26.6% accuracy result

The world’s hardest AI exam, Humanity’s Last Exam, was launched less than two weeks ago, and we’ve already seen a huge jump in accuracy, with ChatGPT o3-mini and now OpenAI’s Deep Reasoning topping the leaderboard.

The AI benchmark created by experts from around the world contains some of the hardest reasoning problems and questions known to man – it’s so hard, that when I previously wrote about Humanity’s Last Exam in the article linked above, I couldn’t even understand one of the questions, let alone answer it.

Share

Leave a comment

Leave a Reply

Your email address will not be published. Required fields are marked *

Related Articles
Flint 3 matches Wi-Fi 7 rivals on specs but undercuts them on price for early adopters
Tech

Flint 3 matches Wi-Fi 7 rivals on specs but undercuts them on price for early adopters

GL.iNet Flint 3 is a powerful Wi-Fi 7 router with 2.5GbE ports...

This 122TB SSD costs ,400, but could shrink data centers and their power bills forever
Tech

This 122TB SSD costs $12,400, but could shrink data centers and their power bills forever

Solidigm’s 122.88TB SSD may not be the fastest, but it wins on...

A new tool predicts when users will reject a new technology
Tech

A new tool predicts when users will reject a new technology

If you can predict that a new technology will not be adopted,...

This futuristic dual-screen laptop looks incredible, but one disappointing flaw might ruin it for power users
Tech

This futuristic dual-screen laptop looks incredible, but one disappointing flaw might ruin it for power users

Aura Ultrabook Dual 14″ Touch is perfect for presentations and scrolling through...