Google’s Veo 2 Just SHOCKED Everyone! (OpenAI Sora Beaten)

Nitika Sharma Last Updated : 18 Dec, 2024
4 min read

Google has made a great comeback in reply to the new releases of video generation models like Sora and Nova Reel. Joining the pact with Veo 2, the video generation game is set to rise in the coming months. Early demonstrations and benchmarks show that Veo 2 may set a new standard for quality, realism, and prompt adherence in AI-generated video content. Let’s explore more about Google Veo 2 and its capabilities. 

What is Veo 2?

Veo 2 is Google DeepMind’s latest AI video generation model, designed to produce high-quality, realistic, and dynamic videos based on detailed prompts. Positioned as a strong competitor to other leading AI video models like OpenAI’s Sora and Meta’s MovieGen, Veo 2 excels in adhering to complex instructions, simulating real-world physics, and capturing a wide range of cinematic effects.

Key Features

  • Accurately interprets nuanced prompts, delivering a wide range of cinematic effects—from time lapses to sweeping aerial shots.
  • Combines textual and visual cues to generate videos that closely align with user intent.
  • Provides tools for directing shot composition, camera angles, and pacing, offering a cinematic level of detail.
  • Maintains coherence throughout each video, ensuring fluid storytelling and a polished final product.

Benchmark Performance and Prompt Adherence

To objectively evaluate AI video models, Facebook Research introduced MovieGen Bench – an environment where various models generate videos from given prompts. Human judges then rate these outputs based on overall preference and how closely they align with the instructions.

In these head-to-head comparisons, Veo 2 consistently outperforms competitors like OpenAI’s Sora Turbo, CLling AI, and Meta’s MovieGen. Not only does Veo 2 excel in quality and viewer preference, but it also demonstrates remarkable prompt adherence. Whether asked to produce a drifting car scene in a bustling cityscape or an intense close-up portrait, Veo 2 reliably matches the user’s request – setting it apart from models that often stray from the original prompt.

Source: Veo 2
  • Extensive Evaluation: 1,003 prompts were tested on MovieGen Bench, a dataset released by Meta.
  • Top Performance: Veo 2 achieved the highest scores in both overall preference and prompt accuracy.
  • Consistent Benchmarking: All models were evaluated at 720p resolution to ensure a fair comparison.
  • Sample Durations: Veo 2’s clips were 8 seconds long, VideoGen’s ran for 10 seconds, and other models produced 5-second clips.
  • Full-Length Display: All videos were shown to raters in their entirety, reinforcing Veo 2’s status as the leading AI video generation model.

Veo 2 vs Sora

Let’s compare videos generated by Veo 2 and Sora, side-by-side:

Prompt 1

A low-angle shot captures a flock of pink flamingos 
gracefully wading in a lush, tranquil lagoon.

The vibrant pink of their plumage
contrasts beautifully with the verdant green
of the surrounding vegetation
and the crystal-clear turquoise water.

Sunlight glints off the water's surface,
creating shimmering reflections
that dance on the flamingos' feathers.

The birds' elegant, curved necks
are submerged as they walk through the shallow water,
their movements creating gentle ripples
that spread across the lagoon.

The composition emphasizes the serenity
and natural beauty of the scene,
highlighting the delicate balance of the ecosystem
and the inherent grace of these magnificent birds.

The soft, diffused light of early morning
bathes the entire scene
in a warm, ethereal glow.

Veo 2 Output: 

Sora Output: 

Prompt 2

A cinematic shot captures a fluffy Cockapoo,
perched atop a vibrant pink flamingo float,
in a sun-drenched Los Angeles swimming pool.

The crystal-clear water sparkles under
the bright California sun,
reflecting the playful scene.

The Cockapoo's fur,
a soft blend of white and apricot,
is highlighted by the golden sunlight,
its floppy ears gently swaying in the breeze.

Its happy expression and wagging tail
convey pure joy and summer bliss.

The vibrant pink flamingo
adds a whimsical touch,
creating a picture-perfect image
of carefree fun in the LA sunshine.

Veo 2 Output: 

Sora Output: 

Prompt 3

A cinematic, high-action tracking shot
follows an incredibly cute dachshund
wearing swimming goggles
as it leaps into a crystal-clear pool.

The camera plunges underwater with the dog,
capturing the joyful moment of submersion
and the ensuing flurry of paddling
with adorable little paws.

Sunlight filters through the water,
illuminating the dachshund's sleek, wet fur
and highlighting the determined expression
on its face.

The shot is filled with the vibrant blues and greens
of the pool water,
creating a dynamic and visually stunning sequence
that captures the pure joy and energy
of the swimming dachshund.

Veo 2 Output: 

Sora Output: 

Observation

What immediately stands out about Veo 2 is its striking realism. From close-up shots to adhering to details, Veo 2 is doing a better job than Sora!

How to Access Veo 2? 

  1. Sign Up for the Waitlist: Veo 2 isn’t publicly available to everyone just yet. Start by joining the waitlist, this will put you in line for access once it’s granted. (sign-up here)
  2. Watch for Email Updates: Keep an eye on your inbox. When your access is approved, you’ll receive a notification email with instructions.
  3. Get Started: Once you have access, using Veo 2 is straightforward. Simply put your prompts and begin generating your own AI-driven video content.

Conclusion

Google’s Veo 2 represents a significant leap forward in AI-driven video generation, outshining competitors. While it’s not perfect, its improvements in prompt adherence, physics simulation, and image fidelity suggest a bright future. As AI video technology continues to advance, it stands as a prime example of how far we’ve come , and how much potential remains on the horizon.

Explore more such awesome content on Analytics Vidhya Blog.

Hello, I am Nitika, a tech-savvy Content Creator and Marketer. Creativity and learning new things come naturally to me. I have expertise in creating result-driven content strategies. I am well versed in SEO Management, Keyword Operations, Web Content Writing, Communication, Content Strategy, Editing, and Writing.

Responses From Readers

Clear

We use cookies essential for this site to function well. Please click to help us improve its usefulness with additional cookies. Learn about our use of cookies in our Privacy Policy & Cookies Policy.

Show details