
Ziyi Wu
@Dazitu_616 • 1,241 subscribers
Incoming RS @GoogleDeepMind working on Genie, CS PhD @UofT | Prev intern at @GoogleDeepMind, @Snap, Undergrad @Tsinghua_Uni
Videos

*Why panorama?* Standard video models struggle with object permanence—if a camera pans away and comes back, objects may disappear. With panoramas, the model is forced to generate everything in the scene. This serves as a "working memory" for consistent world generation. (3/N)
Ziyi Wu21,992 views • 4 months ago

📢 Introducing DenseDPO: Fine-Grained Temporal Preference Optimization for Video Diffusion Models Compared to vanilla DPO, we improve paired data construction and preference label granularity, leading to better visual quality and motion strength with only 1/3 of the data. 🧵
Ziyi Wu35,374 views • 1 year ago
No more content to load