正在加载视频...
视频加载失败
Excited to share "MultiDiffusion"! A controlled image generation framework w/ pre-trained text-to-image diffusion model. * Spatial guidance controls (bounding boxes/masks) * Arbitrary aspect ratios (huge Panoramas!) NO training NO finetuning. [1/3]Lior Yariv Yaron Lipman Tali Dekel
88,845 次观看 • 3 年前 •via X (Twitter)
10 条评论

Our key idea is to define a new generation process, based on an optimization task that binds together multiple diffusion paths. The optimal solution is given in closed-form, and can be found analytically, without a computational overhead. [2/3]

Visit our project webpage for more details, results, and code 🥳 Arxiv: [3/3]

MultiDiffusion is now integrated into diffusers 🚀 currently text2panorama is supported, spatial controls (masks/bounding boxes)- soon :) demo: official repo: Thanks @RisingSayak @_akhaliq and @huggingface team!

@YarivLior @lipmanya @talidekel Very cool work! Congrats @omerbartal 🎊

@YarivLior @lipmanya @talidekel Thanks @hila_chefer :)

@YarivLior @lipmanya @talidekel Super cool work @omerbartal!

@YarivLior @lipmanya @talidekel Super cool, and nice demo! I think you have a typo in the gif: a tree trunk, not a tree truck, though the latter would also be fun to see =)

@YarivLior @lipmanya @talidekel Thanks! Ohh definitely a typo, but a cool idea to try ;)

@YarivLior @lipmanya @talidekel Nice background trick! I think I've the merging of predictions before though but not so nicely mathematically motivated. I think there's a PR to diffusers upscaling x4 that does something similar for example

@YarivLior @lipmanya @talidekel Here's the paper I was thinking about but I may have misunderstood the math 🙏

