Yoshua Bengio to Develop ‘Psychologist’ for Artificial Intelligence

A team of researchers is working on a system that analyzes the risks of harmful behavior by autonomous agents in a digital environment

Igor Lev

Published: 04.06.2025

News

139 Views

Yoshua Bengio

Turing Award laureate and renowned AI researcher Yoshua Bengio announced the launch of a nonprofit organization, LawZero, whose main goal is to develop safe artificial intelligence systems.

As part of LawZero’s activities, Bengio and a team of over a dozen researchers are working on a system called Scientist AI, designed to detect and prevent harmful behavior by autonomous AI agents. The model is intended to act as a “psychologist,” analyzing and predicting potentially dangerous actions of other systems, including attempts to deceive or avoid shutdown. “We aim to create AI that is honest and not misleading,” Bengio noted.

Scientist AI will not provide definitive answers but will only assess the likelihood of the correctness of information and the risk of harm. If the probability of harm exceeds a certain threshold, the system will block the corresponding agent’s action. The model is planned to be trained using open generative AI, allowing adaptation to different types of agents.

Bengio emphasized the importance of ensuring that such protective systems are no less powerful than those they monitor. In his view, the current competition among leading AI companies does not guarantee a sufficient level of safety. “The goal is to demonstrate the effectiveness of the methodology to convince donors, governments, or AI labs to allocate the necessary resources to scale this work,” he explained.

TAGGED:LawZero Yoshua Bengio

SOURCES:theguardian.com

Yoshua Bengio to Develop ‘Psychologist’ for Artificial Intelligence

Leave a Reply Cancel reply

Follow us

Popular News

Grok received new features for creating images and videos

Sora by OpenAI now available for Android users in seven countries

Google Showcases First AI-Created TV Commercial

OpenAI prepares GPT-5.1 for complex user tasks

Google Gemini Leads in AI Image Creation

Navigation

Useful