Loading video...

Video Failed to Load

Go Home

Google presents Genie Generative Interactive Environments introduce Genie, the first generative interactive environment trained in an unsupervised manner from unlabelled Internet videos. The model can be prompted to generate an endless variety of action-controllable virtual worlds described through text, synthetic images, photographs, and even sketches. At 11B parameters, Genie...

684,259 views • 2 years ago •via X (Twitter)

9 Comments

AK's profile picture
AK2 years ago

paper page:

AK's profile picture
AK2 years ago

project page:

XR Multiverse's profile picture
XR Multiverse2 years ago

Google's warehouse of unused tech

crispyshh's profile picture
crispyshh2 years ago

@apples_jimmy Text to Mario achieved externally

meowbooks --🩸/acc's profile picture
meowbooks --🩸/acc2 years ago

that's so cool

Matt Griswold's profile picture
Matt Griswold2 years ago

Can it make Wolfenstein? If not, why not.

Smoke-away's profile picture
Smoke-away2 years ago

🔥🔥🔥

Mat's profile picture
Mat2 years ago

all these papers, no public releases 🫠

Ollin Boer Bohan's profile picture
Ollin Boer Bohan2 years ago

Demo page with more videos.

Related Videos

Tencent presents GameGen-O Open-world Video Game Generation We introduce GameGen-O, the first diffusion transformer model tailored for the generation of open-world video games. This model facilitates high-quality, open-domain generation by simulating a wide array of game engine features, such as innovative characters, dynamic environments, complex actions, and diverse events. Additionally, it provides interactive controllability, thus allowing for the gameplay simulation. The development of GameGen-O involves a comprehensive data collection and processing effort from scratch. We collect and build the first Open-World Video Game Dataset (OGameData), amassed extensive data from over a hundred of next-generation open-world games, employing a proprietary data pipeline for efficient sorting, scoring, filtering, and decoupled captioning. This robust and extensive OGameData forms the foundation of our model's training process. GameGen-O undergoes a two-stage training process, consisting of foundation model pretraining and instruction tuning. In the first phase, the model is pre-trained on the OGameData via the text-to-video and video continuation, endowing GameGen-O with the capability for open-domain video game generation. In the second phase, the pre-trained model is frozen, and we fine-tuned using a trainable InstructNet, which enables the production of subsequent frames based on multimodal structural instructions. This whole training process imparts the model with the ability to generate and interactively control content. In summary, GameGen-O represents a notable initial step forward in the realm of open-world video game generation via generative models. It underscores the potential of generative models to serve as an alternative to rendering techniques, which can efficiently combine creative generation with interactive capabilities.

AK

366,858 views • 1 year ago