By using this website, you agree to our Privacy Policy and Terms of Use.
Accept
Craftium.AICraftium.AICraftium.AI
  • Home
  • News
  • Catalog
  • Collections
  • Blog
Font ResizerAa
Craftium.AICraftium.AI
Font ResizerAa
Пошук
  • Home
  • News
  • Catalog
  • Collections
  • Blog
Follow US
  • Terms of Use
  • Privacy Policy
  • Copyright
  • Feedback
© 2024-2025 Craftium.AI.

OpenAI unveils new AI models o3 and o3-mini

The o3 model approaches AGI, demonstrating high test results but struggling with simple tasks

Igor Lev
Igor Lev
Published: 22.12.2024
News
OpenAI
SHARE

On the final day of its 12-day event, OpenAI introduced a new AI model for reasoning tasks — o3, the successor to the o1 model. Alongside it, a compact version — o3-mini — was presented, designed for specific tasks. This release promises a significant breakthrough in the ability to model cognitive processes.

o3, our latest reasoning model, is a breakthrough, with a step function improvement on our hardest benchmarks. we are starting safety testing & red teaming now. https://t.co/4XlK1iHxFK

— Greg Brockman (@gdb) December 20, 2024

OpenAI states that o3, under certain conditions, approaches AGI — a system capable of performing most economically significant tasks typically done by humans. Although the company emphasizes that this is not yet a definitive breakthrough, o3’s test results significantly surpass previous OpenAI models. In the ARC-AGI test, which evaluates an AI’s ability to acquire new skills beyond its training data, o3 scored 87.5% in high-compute mode, tripling o1’s performance in the lowest mode.

The model achieved outstanding results in various tests: 96.7% on the 2024 American Mathematics Exam, 87.7% in GPQA Diamond, answering graduate-level questions in biology, physics, and chemistry, and set a new record of 25.2% in the Frontier Math test by EpochAI. Despite these achievements, experts such as ARC-AGI co-author François Chollet caution against overestimating these results, pointing to o3’s struggles with simple tasks and the high costs of using its advanced modes.

Another significant improvement in o3 is the ability to adjust computation time, allowing users to choose low, medium, or high modes depending on task complexity. The model uses a “private chain of thought” process, enabling it to internally analyze tasks, explain its reasoning, and provide more reliable results in fields such as physics, mathematics, and programming.

OpenAI acknowledges potential risks associated with o3, given issues found in the previous model. OpenAI teams are now applying a “discriminative alignment” technique to ensure o3’s compliance with safety principles. To minimize risks, OpenAI will first make o3-mini available for testing by safety researchers, while o3 will become available later in 2025. CEO Sam Altman also advocates for the creation of a federal testing system to assess the potential impact of such models.

ChatGPT users received expanded AI model selection settings
OpenAI allowed the choice between GPT-5 and GPT-4o
OpenAI launches GPT-5 with adaptive architecture for ChatGPT
ChatGPT now reminds you that you have been working with AI for too long
The new GPT-5 from OpenAI promises better task automation
TAGGED:AGIOpenAI
Leave a Comment

Leave a Reply Cancel reply

Follow us

XFollow
YoutubeSubscribe
TelegramFollow
MediumFollow

Popular News

Gemini 2.5 Deep Think
Google DeepMind launches Gemini 2.5 Deep Think for Ultra plan subscribers
02.08.2025
Mistral AI
Mistral offers the open language model Voxtral for speech
17.07.2025
Image from Google site
AlphaEarth Foundations creates accurate maps of the Earth in minutes
31.07.2025
ChatGPT Agent
OpenAI introduced ChatGPT Agent, allowing AI to delegate complex tasks
18.07.2025
AI tries to do it all
OpenAI urges caution when using the ChatGPT agent
19.07.2025

Читайте також

Closed chats and search engines
News

OpenAI removes the ability to index open ChatGPT chats on Google

01.08.2025
Learning with AI
News

ChatGPT received Study Mode for user learning mode

30.07.2025
Copilot
News

Microsoft prepares its Copilot service for the launch of GPT-5

25.07.2025

Craftium AI is a team that closely follows the development of generative AI, applies it in their creative work, and eagerly shares their own discoveries.

Navigation

  • News
  • Reviews
  • Collections
  • Blog

Useful

  • Terms of Use
  • Privacy Policy
  • Copyright
  • Feedback

Subscribe for AI news, tips, and guides to ignite creativity and enhance productivity.

By subscribing, you accept our Privacy Policy and Terms of Use.

Craftium.AICraftium.AI
Follow US
© 2024-2025 Craftium.AI
Subscribe
Level Up with AI!
Get inspired with impactful news, smart tips and creative guides delivered directly to your inbox.

By subscribing, you accept our Privacy Policy and Terms of Use.

Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?