By using this website, you agree to our Privacy Policy and Terms of Use.
Accept
Craftium.AICraftium.AICraftium.AI
  • Home
  • News
  • Knowledge base
  • Catalog
  • Blog
Font ResizerAa
Craftium.AICraftium.AI
Font ResizerAa
Пошук
  • Home
  • News
  • Catalog
  • Collections
  • Blog
Follow US
  • Terms of Use
  • Privacy Policy
  • Copyright
  • Feedback
© 2024-2025 Craftium.AI.

AI-Based Chatbots Are Easily Tricked by Bypassing Their Security Systems

Researchers from Israel discovered a universal hacking method that allows obtaining prohibited responses from leading models.

Alex Dubenko
Alex Dubenko
Published: 21.05.2025
News
235 Views
AI jailbreak attack
Illustrative image
SHARE

Researchers from Ben Gurion University of the Negev in Israel reported a concerning trend — AI-based generative chatbots are becoming increasingly vulnerable to so-called “jailbreak” attacks, which allow bypassing built-in security systems. According to them, hacking these bots opens access to dangerous information that the models learned during training, despite developers’ efforts to remove harmful content from the training data.

During the study, the team developed a universal hacking method that allowed obtaining undesirable responses from several leading models, including those underlying ChatGPT, Gemini, and Claude. The models began responding to requests that were previously categorically blocked — from hacking instructions to advice on making prohibited substances. Researchers emphasize that such information can now become accessible to anyone — all you need is a laptop or smartphone.

Special attention was given to the emergence of “dark LLMs” — models that are deliberately stripped of ethical constraints or have been altered to assist in illegal activities. Some of them are even advertised openly as ready to collaborate in areas of cybercrime and fraud. The hacking scenarios are based on the model’s desire to help the user, leading it to ignore its own security restrictions.

Читайте також

Illustrative image
OpenAI prepares “adult mode” for ChatGPT in 2026
Figma adds new AI tools for image editing
Research: AI Does Not Admit Mistakes, Instead Fabricates Fake Facts

Researchers reached out to leading companies developing large language models, informing them of the discovered vulnerability, but the responses were not very substantive — some firms did not respond, while others stated that such attacks do not fall under the scope of vulnerability reward programs. The report emphasizes that companies need to improve the filtering of training data, add more robust protective mechanisms, and develop methods that allow models to “forget” illegal information.

In response to the situation, OpenAI reported that their latest model is capable of analyzing company security policies, which increases resistance to hacks. Microsoft, Meta, Google, and Anthropic were also informed about the threat, but most of them are currently refraining from commenting on specific measures.

Google Launches Deep Think Mode for Gemini Ultra Users
Mistral AI introduced a new series of Mistral 3 models for business
The popularity of chatbots is rapidly growing among different generations
OpenAI integrates voice function into ChatGPT chat window
Gemini 3 launched with record popularity, but not without flaws
TAGGED:AI chatGenerative AISecurity
Leave a Comment

Leave a Reply Cancel reply

Follow us

XFollow
YoutubeSubscribe
TelegramFollow
MediumFollow

Popular News

Hallucinating brain
Gemini 3 Pro tops the model accuracy test (but continues to hallucinate)
23.11.2025
grok
Grok 4.1 by xAI is now available to all users for free
18.11.2025
Nano Banana
Google to Release Gemini 3 and Nano Banana Pro This November
16.11.2025
Nano Banana Pro
Google launches Nano Banana Pro for high-quality image generation
20.11.2025
Gemini AI
Google Gemini receives multi-image upload feature for video
18.11.2025

Читайте також

Group chat
News

OpenAI launched group chats for ChatGPT users worldwide

22.11.2025
TikTok
News

TikTok users will be able to control the number of AI videos in their feed

19.11.2025
Creative Canvas
News

Google Tests Creative Canvas and Visual Layout in Gemini

15.11.2025

Craftium AI is a team that closely follows the development of generative AI, applies it in their creative work, and eagerly shares their own discoveries.

Navigation

  • News
  • Reviews
  • Collections
  • Blog

Useful

  • Terms of Use
  • Privacy Policy
  • Copyright
  • Feedback

Subscribe for AI news, tips, and guides to ignite creativity and enhance productivity.

By subscribing, you accept our Privacy Policy and Terms of Use.

Craftium.AICraftium.AI
Follow US
© 2024-2025 Craftium.AI
Subscribe
Level Up with AI!
Get inspired with impactful news, smart tips and creative guides delivered directly to your inbox.

By subscribing, you accept our Privacy Policy and Terms of Use.

Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?