• Next-Gen Gadgets for ME
  • Middle Eastern Startup Ecosystem
  • FutureTech in ME
  • Reports
  • Next-Gen Gadgets for ME
  • Middle Eastern Startup Ecosystem
  • FutureTech in ME
  • Reports
Home Artifical Intelligence

OpenAI’s Latest Models Hallucinate More: A Step Back for AI Reliability

by Leslie Finecountry
April 20, 2025
in Artifical Intelligence
Reading Time: 2 mins read
open ai web logo

In a surprising development, OpenAI’s newest o3 and o4-mini AI models hallucinate more frequently than their predecessors reversing the expected trend of improving accuracy.

The Hallucination Problem

OpenAI’s internal testing shows concerning results:

  • The o3 model hallucinated on 33% of questions about people—double the rate of previous models
  • The o4-mini performed worse, hallucinating 48% of the time
  • Even OpenAI admits it doesn’t fully understand why this is happening

Third-party testing by Transluce confirmed these issues, finding examples where o3 fabricated processes it claimed to have used, such as running code on external devices.

Why This Matters

This regression in factual reliability creates significant challenges for industries requiring accuracy:

  • Legal firms can’t risk models inserting errors into contracts
  • Financial institutions need reliable analysis without fabricated data
  • Healthcare applications demand extremely high levels of accuracy

Even in areas where the models excel, problems persist. Workera CEO Kian Katanforoosh reports that while o3’s coding capabilities are impressive, it regularly generates broken website links.

The Reasoning Model Trade-off

The industry has pivoted to “reasoning models” as traditional approaches showed diminishing returns. These models improve performance without requiring massive computing resources but appear to make more claims overall—both accurate and inaccurate ones.

Potential Solutions

OpenAI is exploring several approaches:

  • Web search integration (GPT-4o with search achieves 90% accuracy on some benchmarks)
  • Specialized training techniques to reduce hallucinations

“Addressing hallucinations across all our models is an ongoing area of research,” said OpenAI spokesperson Niko Felix.

What’s Next

If scaling reasoning models continues to worsen hallucinations, finding solutions becomes increasingly urgent. For now, users should maintain appropriate skepticism about factual claims and implement verification processes—particularly for critical applications.

#OpenAI #AIHallucinations #TechNews #AIReliability

Advertisement Advertisement Advertisement
ADVERTISEMENT
Previous Post

Meta Begins AI Training with Public User Content in EU, But You Can Opt-Out

Next Post

OpenAI Expresses Interest in Acquiring Chrome as Google Faces Potential Breakup

Recommended For You

AI in Abu Dhabi: 150 New Firms Launched in First Half of 2025
Artifical Intelligence

AI Is Now Foundational to Marketing in the Middle East – But Adoption Alone Is Not the Story

by Kingsley Okeke
April 22, 2026
0

A new wave of research confirms what regional marketing leaders have been quietly implementing for the past two years: artificial intelligence has become the operating system of marketing operations in...

Read moreDetails
Dubai’s DIFC Wants to Be the World’s First AI-Native Financial Center – Here’s What That Actually Means

Dubai’s DIFC Wants to Be the World’s First AI-Native Financial Center – Here’s What That Actually Means

April 21, 2026
Google Rolls Out Personalised AI Features in Gemini App Across the Middle East

Google Rolls Out Personalised AI Features in Gemini App Across the Middle East

April 15, 2026
Almosafer ChatGPT integration changes how Saudi users plan trips

Almosafer ChatGPT integration changes how Saudi users plan trips

April 14, 2026
AGN AI Readiness Framework gives UAE SMEs a smarter start with AI

AGN AI Readiness Framework gives UAE SMEs a smarter start with AI

April 14, 2026
Next Post
Google Logo

OpenAI Expresses Interest in Acquiring Chrome as Google Faces Potential Breakup

open ai gradient logo

OpenAI Launches 'gpt-image-1': Bringing Advanced Image Generation to Adobe, Figma and Beyond

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

ADVERTISEMENT

Subscribe to our Newsletter

Recent News

AI in Abu Dhabi: 150 New Firms Launched in First Half of 2025

AI Is Now Foundational to Marketing in the Middle East – But Adoption Alone Is Not the Story

April 22, 2026
MENA Startup Funding Q1 2026: The Numbers Look Fine. The Worst Is Still Coming

MENA Startup Funding Q1 2026: The Numbers Look Fine. The Worst Is Still Coming

April 22, 2026
Dubai’s DIFC Wants to Be the World’s First AI-Native Financial Center – Here’s What That Actually Means

Dubai’s DIFC Wants to Be the World’s First AI-Native Financial Center – Here’s What That Actually Means

April 21, 2026
Iran-US war

MENA Startup Funding Fell 37% in Q1 2026 — The Iran-US War Is Why

April 21, 2026

Where the Middle East Tech Revolution Begins – Covering tech innovations, startups, and developments across the Middle East..​

Facebook X-twitter Instagram Linkedin

Get In Touch

United Arab Emirates (Dubai)

Email: Info@techsoma.net

Quick Links

Advertise on Techsoma

Publish your Articles

T & C

Privacy Policy

© 2025 — Techsoma Middle East. All Rights Reserved

No Result
View All Result

© 2026 JNews - Premium WordPress news & magazine theme by Jegtheme.

This website uses cookies. By continuing to use this website you are giving consent to cookies being used. Visit our Privacy and Cookie Policy.