Google has added a new feature to the Gemini app that allows uploading multiple reference images for a single video request. Now users can combine these images with a text description to create videos and audio that more accurately match their ideas. This provides more control over the appearance and sound of the final clip.
Previously, Google tested this feature on the Flow platform. In Flow, you can also extend existing clips, merge multiple scenes, and use a larger quota for video creation compared to the Gemini app.
The app uses the Veo 3.1 model, available since mid-October. According to Google, this version of the model creates more realistic textures, better reproduces image details, and provides higher quality audio than the previous Veo 3.0.
The Gemini update makes it easier to create videos with specific user preferences in mind. With support for multiple images and text descriptions, users can experiment with different styles and plots to achieve the desired result.

