Building better 👁️ and 🧠 for 🤖 @Amazon
Prev @Meta @Columbia @CERN
Shorts
How can a visuomotor policy learn from internet videos? We introduce Dreamitate, where a robot uses a fine-tuned video diffusion model to dream the future (top) and imitate the dream to accomplish a task (bottom). website: paper: