正在加载视频...

视频加载失败

Google presents CAT4D Create Anything in 4D with Multi-View Video Diffusion Models

61,949 次观看 • 1 年前 •via X (Twitter)

9 条评论

AK 的头像
AK1 年前

discuss:

Rundi Wu 的头像
Rundi Wu1 年前

Thanks for sharing our work! Project page: arXiv:

BensenHsu 的头像
BensenHsu1 年前

The paper presents a method called CAT4D (Create Anything in 4D) that can generate high-quality dynamic 3D scenes from a single input monocular video. The key idea is to leverage a multi-view video diffusion model trained on a diverse combination of datasets to enable novel view synthesis at any specified camera poses and timestamps. The authors evaluate their method on various tasks, including novel view synthesis, sparse-view static 3D reconstruction in the presence of scene motion, and 4D reconstruction from monocular videos. They show that their method can generate high-quality dynamic 3D scenes and outperforms existing state-of-the-art models that depend on multiple priors and external sources of information. full paper:

HistoricTechOmar Samir 的头像
HistoricTechOmar Samir1 年前

CAT4D? More like create anything in 4D and amaze me!

Daveheardt 的头像
Daveheardt1 年前

4D? Like 4 dimensions? If so - this is not it, this is 3D.

plugbrain 的头像
plugbrain1 年前

Any chance of a code release?

Zero Vertex 的头像
Zero Vertex1 年前

I wish my cat could bake like that. jk I don't have a cat

Fleeber 的头像
Fleeber1 年前

oooo

RinGo_3.0 的头像
RinGo_3.01 年前

👀

相关视频