正在加载视频...
视频加载失败
New paper introduces NaVILA, a vision-language-action (VLA) model that integrates high-level visual-language understanding and low-level locomotion control. It enables humanoid or quadruped robots to navigate unseen environments with natural language instructions.
4 条评论

The Humanoid Hub1 年前
Detailed insights in this thread:

AssemblyAI1 年前
Announcing: Our most advanced speech-to-text model goes beyond accuracy to capture the real-world complexity of human conversation and deliver reliable, source-of-truth audio data. Explore Universal-2 updates 👇

AI Expert Khalid1 年前
I'm fascinated by the potential of NaVILA to revolutionize robot navigation! Imagine instructing robots with just natural language. Pure adrenaline!

maru1 年前
It sounds like
