正在加载视频...

视频加载失败

Announcing Diffusion Forcing Transformer (DFoT), our new video diffusion algorithm that generates ultra-long videos of 800+ frames. DFoT enables History Guidance, a simple add-on to any existing video diffusion models for a quality boost. Website: (1/7)

175,996 次观看 • 1 年前 •via X (Twitter)

8 条评论

Boyuan Chen 的头像
Boyuan Chen1 年前

Classifier-free Guidance (CFG) has been widely used by video diffusion models to boost sample quality. However, researchers rarely perform CFG beyond the first frame. Our paper finds that an equally important conditioning variable, the history, is the long-ignored key. (2/7)

Boyuan Chen 的头像
Boyuan Chen1 年前

Can we train a single model to perform conditional diffusion with different portions of history - variable lengths, subsets of frames, and even different image-domain frequencies? Introducing DFoT, a simple yet flexible add-on that requires no architectural changes. (3/7)

Boyuan Chen 的头像
Boyuan Chen1 年前

Unlike previous methods, DFoT views history or target alike as tokens of different noise levels. DFoT trains diffusion with varying noise levels per frame. To conditionally sample, one simply masks out a portion of history with noise before computing the diffusion score. (4/7)

Boyuan Chen 的头像
Boyuan Chen1 年前

DFoT enables History Guidance (HG), a family of history-conditioned guidance methods that composes diffusion scores from different histories. From its simplest form to its most advanced variant, HG significantly enhances video diffusion and unlocks new abilities. (5/7)

Boyuan Chen 的头像
Boyuan Chen1 年前

We show that DFoT alone is already a competitive model, matching or beating industry SOTA with way more compute than us. Together with HG, it can stably rollout very long videos, stay robust to out-of-distribution context, and stitch sub-trajectories (6/7)

Boyuan Chen 的头像
Boyuan Chen1 年前

For more information, please visit our paper and project website and. Shout out to my awesome collaborators @kiwhansong0, @du_yilun, @max_simchowitz, @RussTedrake and @vincesitzmann (7/7)

Alex // Viral Growth for Apps 的头像
Alex // Viral Growth for Apps1 年前

Here's a simple strategy to churn out 100s of viral videos (we’ve done 40 Million views in the past 3 weeks using this exact framework) 🧵

Amir Arsalan Soltani 的头像
Amir Arsalan Soltani1 年前

Great!

相关视频