Loading video...

Video Failed to Load

Go Home

Google presents CAT4D Create Anything in 4D with Multi-View Video Diffusion Models

61,949 views • 1 year ago •via X (Twitter)

9 Comments

AK's profile picture
AK1 year ago

discuss:

Rundi Wu's profile picture
Rundi Wu1 year ago

Thanks for sharing our work! Project page: arXiv:

BensenHsu's profile picture
BensenHsu1 year ago

The paper presents a method called CAT4D (Create Anything in 4D) that can generate high-quality dynamic 3D scenes from a single input monocular video. The key idea is to leverage a multi-view video diffusion model trained on a diverse combination of datasets to enable novel view synthesis at any specified camera poses and timestamps. The authors evaluate their method on various tasks, including novel view synthesis, sparse-view static 3D reconstruction in the presence of scene motion, and 4D reconstruction from monocular videos. They show that their method can generate high-quality dynamic 3D scenes and outperforms existing state-of-the-art models that depend on multiple priors and external sources of information. full paper:

HistoricTechOmar Samir's profile picture
HistoricTechOmar Samir1 year ago

CAT4D? More like create anything in 4D and amaze me!

Daveheardt's profile picture
Daveheardt1 year ago

4D? Like 4 dimensions? If so - this is not it, this is 3D.

plugbrain's profile picture
plugbrain1 year ago

Any chance of a code release?

Zero Vertex's profile picture
Zero Vertex1 year ago

I wish my cat could bake like that. jk I don't have a cat

Fleeber's profile picture
Fleeber1 year ago

oooo

RinGo_3.0's profile picture
RinGo_3.01 year ago

👀

Related Videos