#depthanything

We are excited to be among the very first groups selected by NVIDIA Robotics to test the new NVIDIA #Thor. We have managed to run a #VisionLanguageModel (Qwen 2.5 VL) for semantic understanding of the environment, along with a monocular depth model (#DepthAnything v2), for safe autonomous navigation, all onboard. No cloud, no internet connection required! The video shows a simple result obtained in just two weeks of work. Kudos to Leonard Bauersfeld Jiaxu Xing Ismail Geles Yannick Armati for making this possible! #ComputerVision #Robotics University of Zurich UZH Space Hub UZH IfI European Research Council (ERC)
Davide Scaramuzza31,433 次观看 • 10 个月前
没有更多内容可加载
