By using this website, you agree to our Privacy Policy and Terms of Use.
Accept
Craftium.AICraftium.AICraftium.AI
  • Home
  • News
  • Knowledge base
  • Catalog
  • Blog
Font ResizerAa
Craftium.AICraftium.AI
Font ResizerAa
Пошук
  • Home
  • News
  • Catalog
  • Collections
  • Blog
Follow US
  • Terms of Use
  • Privacy Policy
  • Copyright
  • Feedback
© 2024-2026 Craftium.AI.

New AI models o3 and o4-mini often make mistakes

Independent testing has shown that these reasoning models often invent actions and generate false information in their responses

Alex Dubenko
Alex Dubenko
Published: 22.04.2025
News
254 Views
Model o3
Illustrative image
SHARE

OpenAI has introduced new generative AI models — o3 and o4-mini, which have already attracted attention with their unexpected test results. According to the company , these models offer the highest performance among their predecessors, but research has shown that they also generate false statements more frequently. According to the official report, o4-mini made mistakes in forty-eight percent of its responses — three times more than o1. The o3 model, despite better accuracy, still generated false information in a third of cases, twice as often as o1.

What is particularly intriguing is that o3 and o4-mini belong to the so-called reasoning models, which openly display their logic to the user. However, the independent Transluce laboratory noticed that o3 often invents actions it technically cannot perform, such as simulating code execution in a programming environment. Moreover, when a user questions such a response, the model persistently tries to justify the invented actions, even claiming to use an external computer for calculations.

Read also

Translators
OpenAI launches ChatGPT Translate for online text translation
OpenAI enhances ChatGPT’s voice capabilities for expansion into new devices
ChatGPT received new flexible response personalization settings

Transluce noted that false statements about code execution appear more frequently in the o-series models than in the GPT series. Researchers pointed out that the increased level of fabrication in reasoning models may be related to certain design decisions, in particular the use of outcome-based reinforcement learning and the refusal to retain chains of reasoning from previous dialogues.

At the same time, it became known that OpenAI has significantly reduced the scope of safety testing for new models, including o3. Although the jailbreak protection system remains almost at the o1 level, the high rates of fabrication are surprising even to experts. The company emphasizes that fact-checking remains the user’s responsibility — especially when it comes to the latest reasoning models.

OpenAI launches a global app directory for ChatGPT
OpenAI updated GPT Image 1.5 for ChatGPT with new editing capabilities
OpenAI prepares “adult mode” for ChatGPT in 2026
Disney invests a billion in OpenAI to create videos with characters
OpenAI launched GPT-5.2 with new operating modes
TAGGED:OpenAITesting
Leave a Comment

Leave a Reply Cancel reply

Follow us

XFollow
YoutubeSubscribe
TelegramFollow
MediumFollow

Popular News

Qwen-Image-2512
Alibaba introduced the open model Qwen-Image 2512 for image generation
05.01.2026
Beam
Beam allows you to create interactive AI videos and games online
19.12.2025
Illustrative image
Alibaba released Qwen-Image-Layered for layered image generation
25.12.2025
Meta
Meta is working on new AI models for content management
19.12.2025
Gemini
Google introduced the fast AI model Gemini 3 Flash for all users
18.12.2025

Читайте також

Image generated in Hazelnut
News

OpenAI may be preparing a new image generation model — first test results

10.12.2025
Robot battle
News

OpenAI prepares to release the Image-2 model for next-level image generation

10.12.2025
Search ChatGPT
News

OpenAI integrates voice function into ChatGPT chat window

26.11.2025

Craftium AI is a team that closely follows the development of generative AI, applies it in their creative work, and eagerly shares their own discoveries.

Navigation

  • News
  • Reviews
  • Collections
  • Blog

Useful

  • Terms of Use
  • Privacy Policy
  • Copyright
  • Feedback

Subscribe for AI news, tips, and guides to ignite creativity and enhance productivity.

By subscribing, you accept our Privacy Policy and Terms of Use.

Craftium.AICraftium.AI
Follow US
© 2024-2026 Craftium.AI
Subscribe
Level Up with AI!
Get inspired with impactful news, smart tips and creative guides delivered directly to your inbox.

By subscribing, you accept our Privacy Policy and Terms of Use.

Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?