By using this website, you agree to our Privacy Policy and Terms of Use.
Accept
Craftium.AICraftium.AICraftium.AI
  • Home
  • News
  • Knowledge base
  • Catalog
  • Blog
Font ResizerAa
Craftium.AICraftium.AI
Font ResizerAa
Пошук
  • Home
  • News
  • Catalog
  • Collections
  • Blog
Follow US
  • Terms of Use
  • Privacy Policy
  • Copyright
  • Feedback
© 2024-2025 Craftium.AI.

Alibaba Model Can Clone Voice from 3 Seconds of Audio

New tools allow easy creation of personalized assistants and voice-overs, working with multiple languages and styles

Eleni Karasidi
Eleni Karasidi
Published: 24.12.2025
News
17 Views
Qwen
Illustrative image.
SHARE

The Qwen team from Alibaba Cloud has introduced two new AI models that allow creating or copying voices using text commands. Both models can generate speech based on text and reproduce a voice similar to the original after listening to just three seconds of audio.

Users can input text, and the system converts it into speech with specified characteristics. A short audio fragment is sufficient for voice cloning, making the process quick and convenient. The models support various languages, including English and Chinese, and work with intonation and speech style.

Read Also

Illustrative image
Alibaba released Qwen-Image-Layered for layered image generation
Google updated Gemini 2.5 for audio translation in Translate
Google Introduced Updated Gemini 2.5 Models for Voice Synthesis

Developers reported that these AI models can be used to create personalized voice assistants, voice-over for videos or audiobooks, as well as for educational and entertainment applications. The service is aimed at a wide audience, including developers and regular users.

Alibaba Cloud plans to further enhance these tools and expand their features, focusing on user data security and protection. New capabilities are already available for testing through the company’s official channels.

Alibaba introduced Live Avatar for creating interactive avatars online
Alibaba released the lightweight Z-Image-Turbo model for uncensored image generation
Alibaba announced a compact model Z-Image for image generation
Qwen-Image-Edit-2509 allows editing multiple images simultaneously
ElevenLabs launched a platform for licensed celebrity voices
TAGGED:AlibabaQwenVoice cloningVoice generation
Leave a Comment

Leave a Reply Cancel reply

Follow us

XFollow
YoutubeSubscribe
TelegramFollow
MediumFollow

Popular News

Frame of video generated in Runway Gen-4
Runway presented five new features for AI video models
14.12.2025
Illustrative image
OpenAI prepares “adult mode” for ChatGPT in 2026
12.12.2025
FLUX.2
Black Forest Labs introduced FLUX.2 models for image creation
27.11.2025
Illustrative image
ShengShu Technology Introduces Enhanced Generation Capabilities in Vidu Q2
04.12.2025
Image generated in Hazelnut
OpenAI may be preparing a new image generation model — first test results
10.12.2025

Читайте також

Voiceover with ChatGPT
Guides

How to create a voiceover using ChatGPT

07.11.2025
Illustrative image
Collections

Best Chrome Extensions for Downloading ChatGPT Voice Responses

01.10.2025
Frame from a video generated in Sora 2
News

OpenAI launched the Sora 2 model, which allows creating videos with sound

01.10.2025

Craftium AI is a team that closely follows the development of generative AI, applies it in their creative work, and eagerly shares their own discoveries.

Navigation

  • News
  • Reviews
  • Collections
  • Blog

Useful

  • Terms of Use
  • Privacy Policy
  • Copyright
  • Feedback

Subscribe for AI news, tips, and guides to ignite creativity and enhance productivity.

By subscribing, you accept our Privacy Policy and Terms of Use.

Craftium.AICraftium.AI
Follow US
© 2024-2025 Craftium.AI
Subscribe
Level Up with AI!
Get inspired with impactful news, smart tips and creative guides delivered directly to your inbox.

By subscribing, you accept our Privacy Policy and Terms of Use.

Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?