Video wird geladen...

Video konnte nicht geladen werden

Zur Startseite

Introducing The Matrix --- a foundation world model for generating infinite-length, hyper-realistic videos with real-time, frame-level control: - Infinite-length video generation - 720p high-quality rendering - Real-time, frame-level control at 16 FPS - Generalization to real-world video control 🔗Blog: 📄Paper: 💻Code & Playable Demo: Coming soon! Key Innovation: A...

178,322 Aufrufe • vor 1 Jahr •via X (Twitter)

10 Kommentare

Profilbild von Hongyang Zhang
Hongyang Zhangvor 1 Jahr

We compare The Matrix with many state-of-the-art game simulators.

Profilbild von Hongyang Zhang
Hongyang Zhangvor 1 Jahr

Interestingly, pre-trained on a vast collection of internet videos combined with AAA game footage, The Matrix demonstrates impressive domain generalization. For instance, it enables scenarios like driving a BMW X3 through an office area.

Profilbild von Hongyang Zhang
Hongyang Zhangvor 1 Jahr

Here’s an example showcasing The Matrix generating an ultra-long video with precise real-time control lasting over 14 minutes (>13440 frames). For more examples, visit our project page:

Profilbild von Vaibhav (VB) Srivastav
Vaibhav (VB) Srivastavvor 1 Jahr

Amazing! Looking forward to the release! 🔥 If you do open model checkpoint release as well, then I’d be happy to help you with that from @huggingface side 🤗 My DMs are open! Let’s make this huge!

Profilbild von Hongyang Zhang
Hongyang Zhangvor 1 Jahr

@huggingface Thank you for the offer. 🤗

Profilbild von xiao sun
xiao sunvor 1 Jahr

in some sense it's a 1D generation (or fake 2D), same view won't show up twice when you look back at it again.

Profilbild von Jason Kneen
Jason Kneenvor 1 Jahr

Everything about gaming, mobile gaming and everything else is about to change. Can't wait for the code drop people are going to go crazy with this :)

Profilbild von Bobby
Bobbyvor 1 Jahr

nice! however "consistency models in real-time" need to also apply to the backgrounds, the mountains completely changing in 5 seconds after the camera turns isn't going to work. Every asset needs to remain consistent - unless the goal is an acidtrip experience for the end user.🫠

Profilbild von Hongyang Zhang
Hongyang Zhangvor 1 Jahr

is also at X, the first author of this project.

Profilbild von mcquin mcdonalds westwood skyhigh underground
mcquin mcdonalds westwood skyhigh undergroundvor 1 Jahr

@Yuchenj_UW host on hyperbolic

Ähnliche Videos