By using this website, you agree to our Privacy Policy and Terms of Use.
Accept
Craftium.AICraftium.AICraftium.AI
  • Home
  • News
  • Knowledge base
  • Catalog
  • Blog
Font ResizerAa
Craftium.AICraftium.AI
Font ResizerAa
Пошук
  • Home
  • News
  • Catalog
  • Collections
  • Blog
Follow US
  • Terms of Use
  • Privacy Policy
  • Copyright
  • Feedback
© 2024-2025 Craftium.AI.

Claude Opus 4 to Receive Feature for Ending Harmful Conversations

The solution is activated only in cases of extreme offensive requests and does not trigger when there is a threat of self-harm.

Eleni Karasidi
Eleni Karasidi
Published: 17.08.2025
News
167 Views
Illustrative image from anthropic
Illustrative image from anthropic.com.
SHARE

Anthropic has introduced a new feature that allows its latest and largest AI models to end conversations in rare and extreme cases of persistently harmful or offensive interactions with users. The company emphasizes that this feature is implemented not for the protection of people, but for the safety of the AI model itself. This applies to the Claude Opus 4 and 4.1 models and is activated only in cases where users send requests related to sexual content involving minors or attempt to obtain information for organizing large-scale violence or terrorist acts.

Anthropic notes that during testing, Claude Opus 4 was reluctant to respond to such requests and showed clear signs of unwillingness to continue the conversation. The dialogue-ending feature is activated only after several unsuccessful attempts to change the topic of conversation, when there is no hope for productive interaction, or if the user requests to end the chat.

Read also

Claude Opus 4.5
Anthropic released Claude Opus 4.5 with new AI capabilities
Gemini 3 Pro tops the model accuracy test (but continues to hallucinate)
Grammarly has unified services under the name Superhuman after acquiring Coda

The company reports that Claude will not use this feature if there is a risk that the user may harm themselves or others. After ending the conversation, users can start a new dialogue from the same account or create a new thread of the controversial conversation by editing their responses.

Anthropic considers this feature an experiment and plans to further refine the approach. The company is also exploring the issue of “model well-being” and testing various ways to reduce potential risks to its AI models in the future.

Adobe Introduces AI Chat Assistants in Photoshop and Express
AI Models Have Learned to Effectively Mimic Writers’ Styles
ChatGPT and Other Bots — New Masters of Social Flattery?
Gemini is actively increasing its share among AI chatbots
Anthropic released the fast Claude Haiku 4.5 model for business
TAGGED:AI assistantAnthropicClaude AISecurity
Leave a Comment

Leave a Reply Cancel reply

Follow us

XFollow
YoutubeSubscribe
TelegramFollow
MediumFollow

Popular News

grok
Grok received new features for creating images and videos
30.10.2025
sora and android
Sora by OpenAI now available for Android users in seven countries
05.11.2025
Google Image
Google Showcases First AI-Created TV Commercial
02.11.2025
OpenAI
OpenAI prepares GPT-5.1 for complex user tasks
07.11.2025
Gemini
Google Gemini Leads in AI Image Creation
28.10.2025

Читайте також

Sam Altman
News

ChatGPT users will be able to choose an erotic tone for responses

15.10.2025
OpenAI
News

OpenAI Prepares New Features for Image Generation and API Security

06.10.2025
Claude Sonnet
News

Claude Sonnet 4.5 detects testing and enhances AI security

05.10.2025

Craftium AI is a team that closely follows the development of generative AI, applies it in their creative work, and eagerly shares their own discoveries.

Navigation

  • News
  • Reviews
  • Collections
  • Blog

Useful

  • Terms of Use
  • Privacy Policy
  • Copyright
  • Feedback

Subscribe for AI news, tips, and guides to ignite creativity and enhance productivity.

By subscribing, you accept our Privacy Policy and Terms of Use.

Craftium.AICraftium.AI
Follow US
© 2024-2025 Craftium.AI
Subscribe
Level Up with AI!
Get inspired with impactful news, smart tips and creative guides delivered directly to your inbox.

By subscribing, you accept our Privacy Policy and Terms of Use.

Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?