By using this website, you agree to our Privacy Policy and Terms of Use.
Accept
Craftium.AICraftium.AICraftium.AI
  • Home
  • News
  • Catalog
  • Collections
  • Blog
Font ResizerAa
Craftium.AICraftium.AI
Font ResizerAa
Пошук
  • Home
  • News
  • Catalog
  • Collections
  • Blog
Follow US
  • Terms of Use
  • Privacy Policy
  • Copyright
  • Feedback
© 2024-2025 Craftium.AI.

ShengShu Technology presented the Vidu Q1 model

The new product enables video generation from two images and text with instantly integrated multilayer audio and improved anime.

Eleni Karasidi
Eleni Karasidi
Published: 26.04.2025
News
Vidu Q1
Frame from the presentation video
SHARE

On April 21, ShengShu Technology presented Vidu Q1 — a browser-based AI model that allows users to create five-second 1080p videos from two images and a text description. Thanks to the “First-to-Last Frame” approach, movements in the clip remain consistent even if the source images are unrelated, opening up new possibilities for independent editing with smooth scene transitions.

In the new version, audio is integrated directly into the workflow — text prompts allow you to generate background music or sound effects at 48 kHz, add multilayer tracks up to ten seconds long, and use time commands such as “0–2 s wind.” This eliminates the need for external sound libraries and speeds up the editing process.

Vidu Q1 also offers improved anime generation — with sharper lines and more stable frame blending, based on the image integrity preservation method first introduced in Vidu 1.5. According to internal VBench tests, the model outperforms Runway Gen-2, OpenAI Sora, and Luma Dream Machine in prompt accuracy and frame consistency.

Read also

vidu
ShengShu Technology unveils the updated Vidu 2.0 platform for video generation

One of the first companies to test Vidu Q1 was Aura Productions, which reported a several-fold reduction in post-production costs for a fifty-episode anime series. The model combines instant image transitions, fast rendering, advanced anime creation, and multilayer audio, giving small teams and bloggers access to cinematic processing capabilities without the need for visual effects or sound specialists.

ShengShu Technology, founded in Singapore in 2023, specializes in multimodal large language models. After opening the Vidu platform to commercial users in July 2024, the company already serves creators in over 200 regions and actively collaborates with film studios, advertising agencies, and social media to implement new Q1 features.

TAGGED:ShengShu TechnologyVidu
Leave a Comment

Leave a Reply Cancel reply

Follow us

XFollow
YoutubeSubscribe
TelegramFollow
MediumFollow

Popular News

Kling AI Image
Cheaper, More Stable, Smarter: Kling AI Launches 2.5 Turbo
25.09.2025
ChatGPT model selection
ChatGPT automatically selects a stricter model in sensitive conversations
29.09.2025
Image from Adobe video
Google Nano Banana will appear in Photoshop to enhance image editing
12.09.2025
Image example
The use of Nano Banana in Gemini grows thanks to mini-figurines (+prompt)
16.09.2025
Google AI
New Opportunities for Audio and Languages in Gemini by Google
09.09.2025

Craftium AI is a team that closely follows the development of generative AI, applies it in their creative work, and eagerly shares their own discoveries.

Navigation

  • News
  • Reviews
  • Collections
  • Blog

Useful

  • Terms of Use
  • Privacy Policy
  • Copyright
  • Feedback

Subscribe for AI news, tips, and guides to ignite creativity and enhance productivity.

By subscribing, you accept our Privacy Policy and Terms of Use.

Craftium.AICraftium.AI
Follow US
© 2024-2025 Craftium.AI
Subscribe
Level Up with AI!
Get inspired with impactful news, smart tips and creative guides delivered directly to your inbox.

By subscribing, you accept our Privacy Policy and Terms of Use.

Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?