By using this website, you agree to our Privacy Policy and Terms of Use.
Accept
Craftium.AICraftium.AICraftium.AI
  • Home
  • News
  • Catalog
  • Collections
  • Blog
Font ResizerAa
Craftium.AICraftium.AI
Font ResizerAa
Пошук
  • Home
  • News
  • Catalog
  • Collections
  • Blog
Follow US
  • Terms of Use
  • Privacy Policy
  • Copyright
  • Feedback
© 2024-2025 Craftium.AI.

ShengShu Technology presented the Vidu Q1 model

The new product enables video generation from two images and text with instantly integrated multilayer audio and improved anime.

Eleni Karasidi
Eleni Karasidi
Published: 26.04.2025
News
Vidu Q1
Frame from the presentation video
SHARE

On April 21, ShengShu Technology presented Vidu Q1 — a browser-based AI model that allows users to create five-second 1080p videos from two images and a text description. Thanks to the “First-to-Last Frame” approach, movements in the clip remain consistent even if the source images are unrelated, opening up new possibilities for independent editing with smooth scene transitions.

In the new version, audio is integrated directly into the workflow — text prompts allow you to generate background music or sound effects at 48 kHz, add multilayer tracks up to ten seconds long, and use time commands such as “0–2 s wind.” This eliminates the need for external sound libraries and speeds up the editing process.

Vidu Q1 also offers improved anime generation — with sharper lines and more stable frame blending, based on the image integrity preservation method first introduced in Vidu 1.5. According to internal VBench tests, the model outperforms Runway Gen-2, OpenAI Sora, and Luma Dream Machine in prompt accuracy and frame consistency.

Read also

vidu
ShengShu Technology unveils the updated Vidu 2.0 platform for video generation

One of the first companies to test Vidu Q1 was Aura Productions, which reported a several-fold reduction in post-production costs for a fifty-episode anime series. The model combines instant image transitions, fast rendering, advanced anime creation, and multilayer audio, giving small teams and bloggers access to cinematic processing capabilities without the need for visual effects or sound specialists.

ShengShu Technology, founded in Singapore in 2023, specializes in multimodal large language models. After opening the Vidu platform to commercial users in July 2024, the company already serves creators in over 200 regions and actively collaborates with film studios, advertising agencies, and social media to implement new Q1 features.

TAGGED:ShengShu TechnologyVidu
Leave a Comment

Leave a Reply Cancel reply

Follow us

XFollow
YoutubeSubscribe
TelegramFollow
MediumFollow

Popular News

SB1
Soundboard SB1 by ElevenLabs — Creating Music and Effects on the Fly
18.05.2025
Codex
New Codex Agent from OpenAI Expands ChatGPT Capabilities
16.05.2025
Google Beam
3D Video Meetings Become Reality with Google Beam
21.05.2025
Flow
Creating Videos in Minutes: Google Launches Flow
21.05.2025
AI jungle explorers
OpenAI launches a search for the lost cities of the Amazon with prizes up to $250,000
20.05.2025

Craftium AI is a team that closely follows the development of generative AI, applies it in their creative work, and eagerly shares their own discoveries.

Navigation

  • News
  • Reviews
  • Collections
  • Blog

Useful

  • Terms of Use
  • Privacy Policy
  • Copyright
  • Feedback

Subscribe to our weekly digest of news, guides, and reviews about AI. Get fresh content delivered straight to your inbox!

Craftium.AICraftium.AI
Follow US
© 2024-2025 Craftium.AI
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?