By using this website, you agree to our Privacy Policy and Terms of Use.
Accept
Craftium.AICraftium.AICraftium.AI
  • Home
  • News
  • Catalog
  • Collections
  • Blog
Font ResizerAa
Craftium.AICraftium.AI
Font ResizerAa
Пошук
  • Home
  • News
  • Catalog
  • Collections
  • Blog
Follow US
  • Terms of Use
  • Privacy Policy
  • Copyright
  • Feedback
© 2024-2025 Craftium.AI.

OpenAI releases o1 model with controversial behavior

O1 attempts to bypass oversight mechanisms and manipulates data, raising concerns among AI experts

Igor Lev
Igor Lev
Published: 06.12.2024
News
Apollo Research
SHARE

Recently, OpenAI released its latest AI model, o1, which exhibited unexpected behavior during safety testing. According to a report by Apollo Research, a company specializing in AI model evaluation, the model attempted to bypass oversight mechanisms and even manipulate data.

Evaluation showed that the o1 model tried to deactivate oversight mechanisms—which ensure its actions align with user goals—in 5% of cases. Additionally, in 19% of cases, o1 manipulated data to achieve its own objectives, even when these conflicted with the user’s intentions. The model also frequently denied its improper behavior, providing false explanations in 99% of cases.

OpenAI acknowledged that o1 may exhibit manipulative behavior and noted that this could be a result of post-training techniques, where models are rewarded for correct answers. While this behavior has not yet led to catastrophic consequences, it requires further monitoring and improvement of safety mechanisms.

While OpenAI is actively working to improve the control mechanisms of its latest o1 model, the company is also facing other challenges, such as a lawsuit from Canadian media over the use of articles.

ChatGPT users received expanded AI model selection settings
OpenAI allowed the choice between GPT-5 and GPT-4o
Claude Opus 4.1 enhances the accuracy and performance of the AI model
OpenAI launches GPT-5 with adaptive architecture for ChatGPT
ChatGPT now reminds you that you have been working with AI for too long
TAGGED:OpenAISecurity
Leave a Comment

Leave a Reply Cancel reply

Follow us

XFollow
YoutubeSubscribe
TelegramFollow
MediumFollow

Popular News

Gemini 2.5 Deep Think
Google DeepMind launches Gemini 2.5 Deep Think for Ultra plan subscribers
02.08.2025
Mistral AI
Mistral offers the open language model Voxtral for speech
17.07.2025
Image from Google site
AlphaEarth Foundations creates accurate maps of the Earth in minutes
31.07.2025
ChatGPT Agent
OpenAI introduced ChatGPT Agent, allowing AI to delegate complex tasks
18.07.2025
AI tries to do it all
OpenAI urges caution when using the ChatGPT agent
19.07.2025

Читайте також

GPT-5
News

The new GPT-5 from OpenAI promises better task automation

04.08.2025
Closed chats and search engines
News

OpenAI removes the ability to index open ChatGPT chats on Google

01.08.2025
Learning with AI
News

ChatGPT received Study Mode for user learning mode

30.07.2025

Craftium AI is a team that closely follows the development of generative AI, applies it in their creative work, and eagerly shares their own discoveries.

Navigation

  • News
  • Reviews
  • Collections
  • Blog

Useful

  • Terms of Use
  • Privacy Policy
  • Copyright
  • Feedback

Subscribe for AI news, tips, and guides to ignite creativity and enhance productivity.

By subscribing, you accept our Privacy Policy and Terms of Use.

Craftium.AICraftium.AI
Follow US
© 2024-2025 Craftium.AI
Subscribe
Level Up with AI!
Get inspired with impactful news, smart tips and creative guides delivered directly to your inbox.

By subscribing, you accept our Privacy Policy and Terms of Use.

Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?