Loading video...

Video Failed to Load

Go Home

New paper introduces NaVILA, a vision-language-action (VLA) model that integrates high-level visual-language understanding and low-level locomotion control. It enables humanoid or quadruped robots to navigate unseen environments with natural language instructions.

20,458 views • 1 year ago •via X (Twitter)

4 Comments

The Humanoid Hub's profile picture
The Humanoid Hub1 year ago

Detailed insights in this thread:

AssemblyAI's profile picture
AssemblyAI1 year ago

Announcing: Our most advanced speech-to-text model goes beyond accuracy to capture the real-world complexity of human conversation and deliver reliable, source-of-truth audio data. Explore Universal-2 updates 👇

AI Expert Khalid's profile picture
AI Expert Khalid1 year ago

I'm fascinated by the potential of NaVILA to revolutionize robot navigation! Imagine instructing robots with just natural language. Pure adrenaline!

maru's profile picture
maru1 year ago

It sounds like

Related Videos