正在加载视频...

视频加载失败

Introducing Marigold 🌼 - a universal monocular depth estimator, delivering incredibly sharp predictions in the wild! Based on Stable Diffusion, it is trained with synthetic depth data only and excels in zero-shot adaptation to real-world imagery. Check it out: 🌐 Website: 🤗 Hugging Face Space: 📄 Paper: 👾 Code:...

489,743 次观看 • 2 年前 •via X (Twitter)

9 条评论

AI葵(Aoi) 🇹🇼NeRF技術型Vtuber 的头像
AI葵(Aoi) 🇹🇼NeRF技術型Vtuber2 年前

70s of inference for an image is very slow..

Anton Obukhov 的头像
Anton Obukhov2 年前

We focus on the quality first - speed will come later through research and improvements in diffusion samplers. Arguably, Stable Diffusion is also slower than most of the GANs :)

Bilawal Sidhu 的头像
Bilawal Sidhu2 年前

Damn those are some crispy depth maps!

Chris Offner 的头像
Chris Offner2 年前

Wow, those depth maps look amazing. Great work! There goes my course project for Deep Learning. 😅

Boyuan Chen 的头像
Boyuan Chen2 年前

Is this metric or relative depth?

Anton Obukhov 的头像
Anton Obukhov2 年前

Relative

Janek Mann 的头像
Janek Mann2 年前

I also consider this strong evidence that Stable Diffusion learned much stronger 3D scene understanding than commonly assumed.

Omar Sanseviero 的头像
Omar Sanseviero2 年前

Very very cool!

Tom 的头像
Tom2 年前

Wow. More detailed than the ground truth, very cool to see.

相关视频