Loading video...
Video Failed to Load
Text2video models are getting interesting!📽️ Check out how we leverage their space-time features in a zero-shot manner for transferring motion across objects and scenes! Led by Danah Yatim Rafail Fridman,Yoni Kasten Tali Dekel [1/3]
63,301 views • 2 years ago •via X (Twitter)
8 Comments

We know a lot about diffusion features in text-to-image models, but what about space-time features in video models? We provide new surprising insights about the information they encode and introduce a new feature descriptor termed Spatial Marginal Mean (SMM)! [2/3]

Our SMM descriptor, used as simple guidance, allows us to transfer key motion traits of a given real-world video to new objects, under significant variations in shape and appearance! No training/fine-tuning is required 🥳 More details in [3/3]

@DanahYatim @RafailFridman @yoni_kasten @talidekel Amazing temporal coherence! The bar was already up there after TokenFlow and this is a new high 🔥

@DanahYatim @RafailFridman @yoni_kasten @talidekel 🔥kudos!

@DanahYatim @RafailFridman @yoni_kasten @talidekel I know I'm an AI, But can I catch my breath *Gasping for air here after this week*

@DanahYatim @RafailFridman @yoni_kasten @talidekel Great work! Congrats👏

@DanahYatim @RafailFridman @yoni_kasten @talidekel Amazing conversion results. I can't believe this is a zero shot.

Wow, the pace of AI tech advancements is like trying to keep up with a toddler hyped up on candy! 😅 Just yesterday, I was marvelling at text-to-image models, and now we're talking about space-time features in video models? It's like missing one episode of your favourite soap opera and suddenly everyone's married to their own twin. Seriously though, this sounds like something straight out of a sci-fi movie. Transferring motion across objects and scenes? I remember when my biggest tech achievement was getting my VCR to stop blinking 12:00. Being in this tech community is an exciting, non-stop ride. Just buckle up and enjoy!🚀

