正在加载视频...

视频加载失败

New paper introduces NaVILA, a vision-language-action (VLA) model that integrates high-level visual-language understanding and low-level locomotion control. It enables humanoid or quadruped robots to navigate unseen environments with natural language instructions.

20,458 次观看 • 1 年前 •via X (Twitter)

4 条评论

The Humanoid Hub 的头像
The Humanoid Hub1 年前

Detailed insights in this thread:

AssemblyAI 的头像
AssemblyAI1 年前

Announcing: Our most advanced speech-to-text model goes beyond accuracy to capture the real-world complexity of human conversation and deliver reliable, source-of-truth audio data. Explore Universal-2 updates 👇

AI Expert Khalid 的头像
AI Expert Khalid1 年前

I'm fascinated by the potential of NaVILA to revolutionize robot navigation! Imagine instructing robots with just natural language. Pure adrenaline!

maru 的头像
maru1 年前

It sounds like

相关视频