Apple Unveils Details of the New Manzano Model for Images

Manzano shows high performance in complex analysis and generation tasks, but is not yet available for widespread use

Published: 30.09.2025

150 Views

Illustrative image.

Apple has introduced a research paper on a new AI model for working with images called Manzano. It is capable of both recognizing and generating images, which is usually a challenging task for open models. The company published the results of Manzano’s tests on complex queries, comparing it with systems like Deepseek Janus Pro, GPT-4o, and Google’s Gemini 2.5 Flash Image Generation.

Apple Unveils Details of the New Manzano Model for Images — Apple Image

Manzano uses a hybrid image tokenizer that provides two types of tokens. Continuous tokens help the model better understand images, while discrete tokens help generate them. Both streams are formed by a shared encoder, reducing conflicts between image analysis and generation tasks.

The Manzano architecture includes a hybrid tokenizer, a unified language model, and a separate image decoder. Apple has created several versions of the decoder with varying numbers of parameters, allowing it to work with resolutions from 256 to 2048 pixels. For training, researchers used over two billion “image-text” pairs and one billion “text-image” pairs from public and internal sources.

In Apple’s tests, the Manzano model showed better results on tasks involving diagram analysis, documents, and other tasks requiring extensive text work. Versions with more parameters demonstrate higher quality performance compared to smaller ones. Manzano confidently handles image generation tasks, stylization, editing, adding new elements, and depth estimation.

Apple believes that the modular structure of Manzano will allow for independent updates of individual components and the application of different training approaches. The model is not yet available for public use, but the company plans to develop its own AI solutions and use GPT-5 from OpenAI in Apple Intelligence, starting with iOS 26.

TAGGED:Apple Image analysis Image generation Manzano

Apple Unveils Details of the New Manzano Model for Images

Leave a Reply Cancel reply

Follow us

Popular News

Sora by OpenAI now available for Android users in seven countries

Google Showcases First AI-Created TV Commercial

OpenAI prepares GPT-5.1 for complex user tasks

ElevenLabs launched a platform for licensed celebrity voices

Google launches Pomelli service for creating AI-driven advertising campaigns

Navigation

Useful

Read also

Leave a Reply Cancel reply

Follow us

Popular News

Читайте також

Level Up with AI!