Google DeepMind debuts realtime, photorealistic 3D world builder Genie 3

Genie 3 is a real time rendered of realistic 3D worlds - and a step toward AGI.
In Genie 3, the world’s your oyster — and you walk around in it, or mash things up. (Picture: Google DeepMind)
The model creates deeply realistic 3D worlds that you can interact with to «achieve goals,» improve education or just have a fun old time.

It builds upon its own previous version, which could only support video in 10-20 seconds at 360p, and on Veo 3 — which makes photorealistic non-interactive videos.

The result is a model that can make 720p video at a smooth 24 fps, understands physics, remembers about a minute back of previous renders for consistency, and creates a 3D world that you can move around in «for a few minutes.»

Fully interactive, on-the-fly rendering
Like Veo 3, all you need to get going is a simple text prompt, but the video is rendered on the fly, so you can also add elements or have things occur in the video while you are going through a simulation.

The obvious implications here is that you could, for example, explore through a walk down the streets of ancient Greece, or experience life like a dinosaur — not to mention what this tech could do for video games, rendering interactive photorealism in real time.

But is also has a specific use in the training of AI agents and artificial general intelligence, sometimes called superintelligence, writes TechCrunch.

Candidate for robot training
Software agents need to interact with the world at some stage, and to do so it will need training.

The Genie 3 model lets AI agents do just that; they can engage with the worlds it is creating and say «walk to the shed» in a garden rendering and learn from that experience.

To get to that point, the world models need to get much more detailed and last more than just a few minutes, but Google Deepmind sees Genie 3 as an important step in that direction.

Not widely available for a while yet
The model is currently in preview mode and is only available to «select creators and academics.»

The creators part is how we get to the point where the Internet likely will fill up with videos from the model. It’s already happening a lot at r/singularity today.

Google says they are working on a slightly wider release to «trusted testers» some time in the future, and there is no timeline for when this could become generally available.

Read more: Google DeepMind’s Genie 3 launch page, X.com thread, writeup on TechCrunch, adds some quotes.