Tech

ChatGPT is getting smarter, but its hallucinations are spiraling

Share
Share


  • OpenAI’s latest AI models, GPT o3 and o4-mini, hallucinate significantly more often than their predecessors
  • The increased complexity of the models may be leading to more confident inaccuracies
  • The high error rates raise concerns about AI reliability in real-world applications

Brilliant but untrustworthy people are a staple of fiction (and history). The same correlation may apply to AI as well, based on an investigation by OpenAI and shared by The New York Times. Hallucinations, imaginary facts, and straight-up lies have been part of AI chatbots since they were created. Improvements to the models theoretically should reduce the frequency with which they appear.

OpenAI’s latest flagship models, GPT o3 and o4-mini, are meant to mimic human logic. Unlike their predecessors, which mainly focused on fluent text generation, OpenAI built GPT o3 and o4-mini to think things through step-by-step. OpenAI has boasted that o1 could match or exceed the performance of PhD students in chemistry, biology, and math. But OpenAI’s report highlights some harrowing results for anyone who takes ChatGPT responses at face value.

Share

Leave a comment

Leave a Reply

Your email address will not be published. Required fields are marked *

Related Articles
Personalized social media features could help users manage time and well-being
Tech

Personalized social media features could help users manage time and well-being

Credit: CC0 Public Domain Redesigning social media to suit different needs of...

Is the Galaxy S25 Edge ready for its debut? Samsung sets May 12 for virtual Galaxy Unpacked
Tech

Is the Galaxy S25 Edge ready for its debut? Samsung sets May 12 for virtual Galaxy Unpacked

Samsung’s next Galaxy Unpacked is a virtual-only affair on May 12, 2025...

This tiny 9 box has more power than your full-size PC – and it runs 8K games with ease
Tech

This tiny $829 box has more power than your full-size PC – and it runs 8K games with ease

Aoostar GT37 mini PC delivers 12-core performance, 80 TOPS of AI, and...

Automated tool offers real-time feedback for English pronunciation among non-native speakers
Tech

Automated tool offers real-time feedback for English pronunciation among non-native speakers

Credit: Nothing Ahead from Pexels A new system that improves on the...