
Share
Google's new Veo 2 and Imagen 3 models offer groundbreaking advancements in video and image generation, bringing cinematic quality visuals and innovative tools like Whisk to creators worldwide.
Dec 16, 2024
Google has just rolled out significant updates to its video and image generation models with the release of Veo 2 and Imagen 3. These new versions are now integrated into Google Labs tools like VideoFX and ImageFX, along with a novel experiment called Whisk. Let's dive into what’s changed and why it matters for practitioners.
Veo 2 is the latest iteration of Google’s video generation model, designed to produce high-quality videos that are more realistic and cinematographically sound. Here’s a breakdown of the key improvements:
Imagen 3 is the updated version of Google’s image generation model. It brings several enhancements to the table:
Whisk is an experimental tool that allows users to generate images based on natural language descriptions. It leverages the advancements in Imagen 3 and introduces a few unique features:

To achieve these improvements, Google has made several technical advancements:
Architecture Enhancements:
Training Data:
Performance Metrics:
For developers and researchers working with generative models, these updates offer several benefits:
Google’s release of Veo 2 and Imagen 3 marks a significant step forward in the field of generative AI. These models not only push the boundaries of what’s possible but also provide practical tools for creators and researchers. Whether you’re looking to enhance your video production or explore new avenues in image generation, these updates are definitely worth checking out.
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
17 December 2024
88 articles
Related Articles
Related Articles
More Stories