Loading video...

Video Failed to Load

Go Home

Diffusion models generate high-quality images but require hundreds of forward passes. MIT CSAIL and Adobe Research introduce Distribution Matching Distillation (DMD), a distillation approach that converts costly multi-step diffusion models into fast one-step generators. A thread 🧵

34,347 views • 2 years ago •via X (Twitter)

9 Comments

MIT CSAIL's profile picture
MIT CSAIL2 years ago

DMD trains a one-step generator that maps random noise into realistic images, consisting of two key components. First up: it uses a regression loss to anchor the mapping process, ensuring a coarse organization of the image space, enhancing the stability of the training phase.

MIT CSAIL's profile picture
MIT CSAIL2 years ago

Additionally, it employs a distribution matching loss to guarantee that the likelihood of generating a specific image w/the student model aligns w/its actual frequency of occurrence in the real world.

MIT CSAIL's profile picture
MIT CSAIL2 years ago

The gradient of this loss is formulated as the difference between two diffusion models’ output, trained on real and fake samples respectively.

MIT CSAIL's profile picture
MIT CSAIL2 years ago

DMD achieves a strong 11.49 FID on zero-shot COCO-30K, comparable to Stable Diffusion v1.5 while being 30X faster. Compared to previous approaches, it notably balances image quality with sample diversity.

MIT CSAIL's profile picture
MIT CSAIL2 years ago

DMD paves the way for real-time visual generation. This same approach could improve diffusion-based generative models across various fields, from design, to scientific discovery and beyond, by significantly enhancing speed and effectiveness.

MIT CSAIL's profile picture
MIT CSAIL2 years ago

Paper: Authors: @TianweiY, @m_gharbi, @rzhang88, @elishechtman, @fredodurand, Bill Freeman, and Taesung Park. Project page: MIT News:

menguzat's profile picture
menguzat2 years ago

@AdobeResearch will you release the code / model for this?

Prashant's profile picture
Prashant2 years ago

@AdobeResearch Could this approach of distribution matching loss be applied to other generative AI tasks besides image generation? For example, text generation or music composition?

𝗦𝗼𝘂𝗹𝘀𝗳𝗲𝗻𝗴 𝗡𝗲𝘄 𝗬𝗼𝗿𝗸's profile picture
𝗦𝗼𝘂𝗹𝘀𝗳𝗲𝗻𝗴 𝗡𝗲𝘄 𝗬𝗼𝗿𝗸2 years ago

@rzhang88 @AdobeResearch Good for you my friend, we are try use your model colorization(which is 4 years ago) for sneakers now, lol, thank you for everything.

Related Videos