Veo is capable of generating “high quality” 1080p resolution videos in a range of different visual and cinematic styles from text or image-based prompts. When the model was first announced, these generated clips could be vaguely “longer than a minute” in length, but Google doesn’t specify any length restrictions for the preview version. Some new sample clips in Google’s announcement are on par with what we’ve already seen from Veo – without a careful eye, it’s extremely hard to tell that the videos are generated by AI.
The latest version of Google’s Imagen 3 text-to-image generator will also be available to all Google Cloud customers via Vertex “starting next week”, expanding its initial US release on Google’s AI Test Kitchen in August. Users on Google’s green list can also access new features like prompt-based photo editing and the ability to “infuse your own brand, style, logo, subject or product features” in the generated images.
Google says Veo and Imagen 3 have built-in protections to prevent them from generating harmful content or violating copyright protections – although we found the latter to be not difficult to circumvent. Everything produced by Veo and Imagen 3 is also integrated with DeepMind’s SynthID technology, a kind of invisible digital watermark that Google says can “decrease issues of misinformation and misattribution.” This is a similar concept to Adobe’s Content Credentials system, which can be integrated into content produced by the creative software giant’s own image and video generating AI models.
With Google’s video model now widespread, OpenAI is significantly behind its competitors and running out of time to deliver on its promise to release Sora by the end of 2024. We’re already seeing AI-generated content appearing in ads like Coca-Cola’s recent Christmas campaignand businesses are encouraged not to wait for Sora: according to Google, 86% of organizations already using generative AI see their revenue increase.