Loading video...
Video Failed to Load
Robots might learn better from video than from language! 📼 Most Vision-Language-Action (VLA) models learn what to do from text, but still struggle with how things move in the real world. That makes them data-hungry and slow to train. mimic video takes a different route. Instead of grounding robot... show more
49,920 views • 6 months ago •via X (Twitter)
0 Comments
No comments available
Comments from the original post will appear here

