Gemini 2.5 receives “conversational image segmentation” feature

The new capability allows highlighting objects in photos in different languages and obtaining coordinates of the desired elements

Eleni Karasidi

Published: 28.07.2025

News

161 Views

Highlighting objects in action. Image from Google site.

Google introduced a new feature for its AI model Gemini 2.5, which allows highlighting and analyzing parts of images using ordinary text queries. Users can now interact with the model in natural language and receive responses that consider complex queries, such as “person with an umbrella” or “everyone who is not sitting.” Gemini recognizes not only clear objects but also abstract concepts like “clutter” or “damage,” and can also find elements based on text in the image.

Gemini 2.5 receives "conversational image segmentation" feature — Highlighting people “who are not sitting”

The feature supports multilingual queries and can provide captions for objects in other languages. Users receive results in the form of coordinates of the selected area, pixel masks, and captions, allowing quick identification of the desired part of the image. There is no need to use separate tools or models, as everything is processed by the Gemini model itself.

Developers have access to the new capability through the Gemini API. Google recommends using the “gemini-2.5-flash” model and setting the “thinkingBudget” parameter to zero for instant responses. Initial testing can be conducted in Google AI Studio or through Python Colab.

The feature will be useful for designers, who can now highlight details in photos with simple commands, such as “highlight the building’s shadow.” In the field of occupational safety, Gemini will help identify violations, such as “all people on the construction site without helmets.” In insurance, this capability allows for automatic marking of damaged buildings on aerial photographs, saving time during damage assessment.

TAGGED:Gemini Google Image analysis

Gemini 2.5 receives “conversational image segmentation” feature

Leave a Reply Cancel reply

Follow us

Popular News

Grok received new features for creating images and videos

Sora by OpenAI now available for Android users in seven countries

Google Showcases First AI-Created TV Commercial

OpenAI prepares GPT-5.1 for complex user tasks

Google Gemini Leads in AI Image Creation

Navigation

Useful

Read also

Leave a Reply Cancel reply

Follow us

Popular News

Читайте також

Level Up with AI!