Video wird geladen...
Video konnte nicht geladen werden
New paper introduces NaVILA, a vision-language-action (VLA) model that integrates high-level visual-language understanding and low-level locomotion control. It enables humanoid or quadruped robots to navigate unseen environments with natural language instructions.
20,458 Aufrufe • vor 1 Jahr •via X (Twitter)
4 Kommentare

The Humanoid Hubvor 1 Jahr
Detailed insights in this thread:

AssemblyAIvor 1 Jahr
Announcing: Our most advanced speech-to-text model goes beyond accuracy to capture the real-world complexity of human conversation and deliver reliable, source-of-truth audio data. Explore Universal-2 updates 👇

AI Expert Khalidvor 1 Jahr
I'm fascinated by the potential of NaVILA to revolutionize robot navigation! Imagine instructing robots with just natural language. Pure adrenaline!

maruvor 1 Jahr
It sounds like
