Techsoma
Latest AI Innovation Global Reports Startups FinTech Funding Tech
Next-Gen Gadgets for ME Middle Eastern Startup Ecosystem FutureTech in ME Reports Artifical Intelligence Middle East Innovation Frontier Global News Reports Middle Eastern Startup Ecosystem Fintech Investment Funding FutureTech in ME
Techsoma Middle East
  • About
  • Advertise
  • Privacy & Policy
  • Contact
No Result
View All Result
Techsoma
  • About
  • Advertise
  • Privacy & Policy
  • Contact
No Result
View All Result
Techsoma
No Result
View All Result
Home Artifical Intelligence

OpenAI’s Latest Models Hallucinate More: A Step Back for AI Reliability

by Leslie Finecountry
April 20, 2025
in Artifical Intelligence
Reading Time: 2 mins read
open ai web logo

In a surprising development, OpenAI’s newest o3 and o4-mini AI models hallucinate more frequently than their predecessors reversing the expected trend of improving accuracy.

The Hallucination Problem

OpenAI’s internal testing shows concerning results:

  • The o3 model hallucinated on 33% of questions about people—double the rate of previous models
  • The o4-mini performed worse, hallucinating 48% of the time
  • Even OpenAI admits it doesn’t fully understand why this is happening

Third-party testing by Transluce confirmed these issues, finding examples where o3 fabricated processes it claimed to have used, such as running code on external devices.

Why This Matters

This regression in factual reliability creates significant challenges for industries requiring accuracy:

  • Legal firms can’t risk models inserting errors into contracts
  • Financial institutions need reliable analysis without fabricated data
  • Healthcare applications demand extremely high levels of accuracy

Even in areas where the models excel, problems persist. Workera CEO Kian Katanforoosh reports that while o3’s coding capabilities are impressive, it regularly generates broken website links.

The Reasoning Model Trade-off

The industry has pivoted to “reasoning models” as traditional approaches showed diminishing returns. These models improve performance without requiring massive computing resources but appear to make more claims overall—both accurate and inaccurate ones.

Potential Solutions

OpenAI is exploring several approaches:

  • Web search integration (GPT-4o with search achieves 90% accuracy on some benchmarks)
  • Specialized training techniques to reduce hallucinations

“Addressing hallucinations across all our models is an ongoing area of research,” said OpenAI spokesperson Niko Felix.

What’s Next

If scaling reasoning models continues to worsen hallucinations, finding solutions becomes increasingly urgent. For now, users should maintain appropriate skepticism about factual claims and implement verification processes—particularly for critical applications.

#OpenAI #AIHallucinations #TechNews #AIReliability

Leslie Finecountry

Leslie Finecountry

Recommended For You

Anthropic lists ahead of Openai
Artifical Intelligence

Anthropic Beats OpenAI to the IPO Window as Gulf Sovereign Funds Back Both Sides

by Kingsley Okeke
June 11, 2026

The race to go public just got real. Anthropic, the San Francisco-based company behind the Claude AI model, confidentially submitted its IPO paperwork to the US Securities and Exchange Commission...

Read moreDetails

MoEI signs MoU with 42 Abu Dhabi, showcases National Data Center Observatory

June 8, 2026

Foras.AI Backs Efham.ai to Build First Arabic AI Learning Community

June 6, 2026

Broadband Systems and Oman Data Park Sign MoU to advance AI infrastructure in Rwanda

June 4, 2026

Dubai Holding and Microsoft takes AI into daily operations across real estate, hotels, retail, and community services

May 21, 2026
Next Post
Google Logo

OpenAI Expresses Interest in Acquiring Chrome as Google Faces Potential Breakup

open ai gradient logo

OpenAI Launches 'gpt-image-1': Bringing Advanced Image Generation to Adobe, Figma and Beyond

Please login to join discussion

Recent News

Anthropic lists ahead of Openai

Anthropic Beats OpenAI to the IPO Window as Gulf Sovereign Funds Back Both Sides

June 11, 2026
Bybit IPO Express

Bybit Launches IPO Express to Give Retail Users Tokenised Access to SpaceX IPO

June 8, 2026

MoEI signs MoU with 42 Abu Dhabi, showcases National Data Center Observatory

June 8, 2026

Foras.AI Backs Efham.ai to Build First Arabic AI Learning Community

June 6, 2026

MEA Smartphone Shipments Fall 7 Percent in Q1 2026 as Memory Crisis Guts Budget Segment

June 4, 2026

Techsoma Africa reports on startups, fintech, AI, digital policy, and the builders shaping Africas innovation economy.

Follow Techsoma Africa

SEARCH BY CATEGORIES

  • Amazon (6)
  • Apps (9)
  • Artifical Intelligence (255)
  • Aviation (5)
  • Business (14)
  • Clean Energy Tech (7)
  • Coding (1)
  • Creator Economy (7)
  • Cryptocurrency (9)
  • Cybersecurity (24)
  • E-commerce (9)
  • EdTech (4)
  • Electric Cars (13)
  • Fintech (47)
  • Future Tech (16)
  • FutureTech in ME (40)
  • Gaming (5)
  • Global News (112)
  • Healthcare (11)
  • Image Generation (3)
  • Investment Funding (45)
  • Investor Hotspots (31)
  • Latest Gadgets (5)
  • Metaverse (1)
  • Middle East Event Radar (31)
  • Middle East Innovation Frontier (121)
  • Middle East Tech Revolution (28)
  • Middle Eastern Startup Ecosystem (55)
  • Mobility / Logistics (14)
  • Next-Gen Gadgets for ME (15)
  • Opinions (14)
  • Politics (1)
  • Proptech (2)
  • Reports (67)
  • Robotics (16)
  • Social Media (12)
  • Space Tech (3)
  • Startups (12)
  • Tech (3)
  • Tech & Society (5)
  • Tech Gadgets (8)
  • Tech Policy in Middle East (11)
  • Technology (13)
  • Telecommunications (12)
  • Trade & Policy (4)
  • Uncategorized (8)
  • Venture Capital (3)
  • Wearable Tech (3)

Recent News

Anthropic lists ahead of Openai

Anthropic Beats OpenAI to the IPO Window as Gulf Sovereign Funds Back Both Sides

June 11, 2026
Bybit IPO Express

Bybit Launches IPO Express to Give Retail Users Tokenised Access to SpaceX IPO

June 8, 2026
  • About
  • Advertise
  • Privacy & Policy
  • Contact

Copyright 2026 Techsoma Middle East. All rights reserved.

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
Techsoma

© 2026 Techsoma Media.

Company

Apps Startups Tech Reports

Legal

Terms Privacy RSS

Latest

Anthropic Beats OpenAI to the IPO Window as Gulf Sovereign Funds Back Both Sides The race to go public just got real. Anthropic, the San Francisco-based company behind the Claude AI model,... Bybit Launches IPO Express to Give Retail Users Tokenised Access to SpaceX IPO   Bybit, the world's second-largest cryptocurrency exchange by trading volume, has launched a new product called IPO Express... MoEI signs MoU with 42 Abu Dhabi, showcases National Data Center Observatory The Ministry of Energy and Infrastructure has introduced the National Data Center Observatory, an AI-based platform that helps...
No Result
View All Result

Copyright 2026 Techsoma Middle East. All rights reserved.