News

Google DeepMind's Genie 3 generates photorealistic interactive worlds in real time at 720p

Aug 5, 2025

Key Points

Google DeepMind released Genie 3, a world model generating photorealistic interactive 3D environments at 720p/24fps from text prompts with visual persistence lasting minutes.
The system extends temporal coherence well beyond previous world models' one-minute limit, enabling real-time play comparable to recent video generation breakthroughs like Runway Gen 3.
Genie 3 could supply unlimited synthetic training data for robotics and reinforcement learning, potentially replacing costly real-world data collection for embodied AI agents.

Summary

Google DeepMind released Genie 3, a world model that generates interactive 3D environments in real time at 720p and 24 frames per second from text prompts. The system maintains visual consistency for a few minutes, allowing users to navigate dynamically generated worlds with keyboard controls.

The technical advance is photorealism paired with interactivity at scale. Earlier world models, including prior versions of Genie, produced convincing visuals but lost temporal coherence after roughly a minute. Genie 3 extends that window and runs fast enough for real-time play, matching the fidelity of recent video generation tools like Runway's Gen 3.

The immediate applications are synthetic data generation for robotics and reinforcement learning training. Genie 3 can produce unlimited interactive environments on demand, potentially replacing or supplementing expensive real-world data collection for training embodied agents. There is also consumer interest. People enjoy watching and exploring AI-generated worlds for their own sake, similar to how generative art tools found audiences beyond their original research purpose.

The current limitation is game mechanics. Genie 3 supports movement but lacks standard interactive features like jumping. This gap signals how early the technology remains. Even a basic mechanic that players instinctively expect is absent, suggesting substantial work remains before this becomes a playable game-like experience.

The release arrives alongside other generative AI announcements this week. OpenAI open-sourced GPT-o, a reasoning model that runs on consumer hardware. Anthropic announced Claude Opus 4.1 for coding. Reflection, a one-year-old startup, is in talks to raise $1 billion for open-source models. Genie 3 moves in a different direction: not model capability or code generation, but the generation of interactive synthetic environments.

You might also like...

Google launches Nano Banana Pro image model built on Gemini 3 with flawless text rendering

Nov 20, 2025

Google plans to add ads to Gemini in 2026

Dec 8, 2025

Google IO 2025: Logan Kilpatrick and Tulsee Doshi on Gemini 2.5, AI mode in Search, and glasses

May 20, 2025