creative AImodel releasesmultimodal AIresearch and science

Genie conjures up new worlds

Google DeepMind’s Genie 3 generates interactive, navigable 3D worlds from text, advancing video generation into controllable simulation for agents and creators.

ExoBrain

09 August 20251 min read

This image shows nine snapshots from Genie 3, Google DeepMind’s new world model. Each scene is an interactive environment, generated from text, navigable in real time. Worlds run at 720p and 24 frames per second, staying coherent for minutes with about a minute of visual memory. You can steer with keys and trigger ‘promptable events’ such as weather shifts or new objects. Compared with Genie 2, the quality and length of interactions are significantly extended. We’re seeing ground-breaking progress with video generation evolving into controllable simulation, opening faster training for agents in synthetic worlds and prototyping for creators and game designers.

Subscribe to the ExoBrain Weekly Newsletter

Stay up to date with AI. Get analysis of the week's most important stories, plus a focused roundup across business, governance, research and infrastructure.

Genie conjures up new worlds

Visual thinking points to the next wave

Gemini 3 leaves competitors scrambling

Wordsmiths in the dark

Photo editing goes bananas

Subscribe to the ExoBrain Weekly Newsletter