Loading video...
Video Failed to Load
New paper introduces NaVILA, a vision-language-action (VLA) model that integrates high-level visual-language understanding and low-level locomotion control. It enables humanoid or quadruped robots to navigate unseen environments with natural language instructions.
20,458 views • 1 year ago •via X (Twitter)
4 Comments

The Humanoid Hub1 year ago
Detailed insights in this thread:

AssemblyAI1 year ago
Announcing: Our most advanced speech-to-text model goes beyond accuracy to capture the real-world complexity of human conversation and deliver reliable, source-of-truth audio data. Explore Universal-2 updates 👇

AI Expert Khalid1 year ago
I'm fascinated by the potential of NaVILA to revolutionize robot navigation! Imagine instructing robots with just natural language. Pure adrenaline!

maru1 year ago
It sounds like
