By using this website, you agree to our Privacy Policy and Terms of Use.
Accept
Craftium.AICraftium.AICraftium.AI
  • Home
  • News
  • Catalog
  • Collections
  • Blog
Font ResizerAa
Craftium.AICraftium.AI
Font ResizerAa
Пошук
  • Home
  • News
  • Catalog
  • Collections
  • Blog
Follow US
  • Terms of Use
  • Privacy Policy
  • Copyright
  • Feedback
© 2024-2025 Craftium.AI.

Alibaba presented new AI models Qwen2.5-VL with unique capabilities

The models outperform competitors, can analyze text, images, and interact with applications on various devices

Eleni Karasidi
Eleni Karasidi
Published: 28.01.2025
News
Qwen Chat
Illustrative image
SHARE

The Alibaba team has presented a new series of AI models called Qwen2.5-VL. These models can perform a variety of tasks involving text and image analysis, including object recognition in images, document analysis, and video understanding. The models can also control PCs, similar to the functionality of the Operator model from OpenAI. According to test results, Qwen2.5-VL outperforms GPT-4o from OpenAI, Claude 3.5 Sonnet from Anthropic, and Gemini 2.0 Flash from Google.

Qwen2.5-VL is available for testing in the Qwen Chat app and on the Hugging Face platform. It can analyze graphs and charts, extract data from scanned invoices and forms, and also understand videos several hours long. The model can recognize IP from movies and TV series, as well as various products, which suggests possible training on copyrighted materials.

Read also

Hugging Face Sheets
A New Way to Create Tables with AI from Hugging Face
Perplexity Launches Labs for Pro Users
ChatGPT’s Accuracy in Guessing Photo Locations Amazes Users

One of the interesting features of Qwen2.5-VL is its ability to interact with software on PCs and mobile devices. For example, it can launch applications and perform tasks such as booking flights through mobile apps. This opens up new possibilities for automation and simplifying the use of various services.

The Qwen2.5-VL series includes several models, of which two smaller ones, Qwen2.5-VL-3B and Qwen2.5-VL-7B, are available under a liberal license. The most powerful model, Qwen2.5-VL-72B, has a special license from Alibaba.

Qwen Chat now speaks and can see through the camera
Alibaba unveils QwQ-Max-Preview with advanced “thinking”
Google Gemini now allows you to upload and analyze files
Qwen Chat
Alibaba unveils new version of Qwen Chat with AI features
TAGGED:AlibabaData analysisImage analysisQwen
Leave a Comment

Leave a Reply Cancel reply

Follow us

XFollow
YoutubeSubscribe
TelegramFollow
MediumFollow

Popular News

SB1
Soundboard SB1 by ElevenLabs — Creating Music and Effects on the Fly
18.05.2025
Collective language formation
Artificial Intelligence Creates Its Own Language Rules in Groups
15.05.2025
Codex
New Codex Agent from OpenAI Expands ChatGPT Capabilities
16.05.2025
Google Beam
3D Video Meetings Become Reality with Google Beam
21.05.2025
AI worker stress
Generative AI Negatively Affects Employee Motivation
15.05.2025

Читайте також

Демон
Blog

Qwen Chat adds free video and image generation features

26.01.2025
Qwen Chat
News

Alibaba unveils its own web app Qwen Chat

13.01.2025
QwQ-32B-Preview
News

Alibaba introduced QwQ-32B-Preview — a model for logical tasks

28.11.2024

Craftium AI is a team that closely follows the development of generative AI, applies it in their creative work, and eagerly shares their own discoveries.

Navigation

  • News
  • Reviews
  • Collections
  • Blog

Useful

  • Terms of Use
  • Privacy Policy
  • Copyright
  • Feedback

Subscribe to our weekly digest of news, guides, and reviews about AI. Get fresh content delivered straight to your inbox!

Craftium.AICraftium.AI
Follow US
© 2024-2025 Craftium.AI
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?