Google has announced the release of a new generative AI model called Gemini 2.0 Flash. This model can generate images, audio, and text, as well as use third-party apps and services. It can also execute code, interact with Google Search, and provide answers to queries related to photos, videos, and audio recordings.
Gemini 2.0 Flash will be available to developers via the Gemini API and the Google AI Studio and Vertex AI platforms. However, audio and image generation capabilities will initially be available only to “early access partners.”
The new model replaces the previous version, Gemini 1.5 Pro, thanks to improved code processing and image analysis capabilities. “We know that Flash is extremely popular among developers for its balance of speed and performance,” said Tulsee Doshi, product lead for the Gemini model at Google.
To ensure safety, Google uses SynthID watermarking technology on all audio and images created with 2.0 Flash.