Loading video...

Video Failed to Load

Go Home

Introducing Marigold 🌼 - a universal monocular depth estimator, delivering incredibly sharp predictions in the wild! Based on Stable Diffusion, it is trained with synthetic depth data only and excels in zero-shot adaptation to real-world imagery. Check it out: 🌐 Website: 🤗 Hugging Face Space: 📄 Paper: 👾 Code:...

490,031 views • 2 years ago •via X (Twitter)

9 Comments

AI葵(Aoi) 🇹🇼NeRF技術型Vtuber's profile picture
AI葵(Aoi) 🇹🇼NeRF技術型Vtuber2 years ago

70s of inference for an image is very slow..

Anton Obukhov's profile picture
Anton Obukhov2 years ago

We focus on the quality first - speed will come later through research and improvements in diffusion samplers. Arguably, Stable Diffusion is also slower than most of the GANs :)

Bilawal Sidhu's profile picture
Bilawal Sidhu2 years ago

Damn those are some crispy depth maps!

Chris Offner's profile picture
Chris Offner2 years ago

Wow, those depth maps look amazing. Great work! There goes my course project for Deep Learning. 😅

Boyuan Chen's profile picture
Boyuan Chen2 years ago

Is this metric or relative depth?

Anton Obukhov's profile picture
Anton Obukhov2 years ago

Relative

Janek Mann's profile picture
Janek Mann2 years ago

I also consider this strong evidence that Stable Diffusion learned much stronger 3D scene understanding than commonly assumed.

Omar Sanseviero's profile picture
Omar Sanseviero2 years ago

Very very cool!

Tom's profile picture
Tom2 years ago

Wow. More detailed than the ground truth, very cool to see.

Related Videos