正在加载视频...

视频加载失败

Introducing Neural Jacobian Fields, robot 3D kinematic models learned only from vision! They can model & control robots from just a single RGB camera, even those w/ intractable kinematics & no embedded sensors such as soft, 3D-printed pneumatic hands! 1/n

54,023 次观看 • 1 年前 •via X (Twitter)

14 条评论

Vincent Sitzmann 的头像
Vincent Sitzmann1 年前

This project was led by the outstanding @sizhe_lester_li, in a multi-disciplinary collaboration with @annan__zhang, @BoyuanChen0, Hanna Matusik, Chao Liu, and Daniela Rus! Amazing team - truly grateful for the soft robot chops hands-on contributed by @annan__zhang & co! 2/n

Vincent Sitzmann 的头像
Vincent Sitzmann1 年前

While we humans constantly use vision for closed-loop control of our hands and limbs, today’s robots instead rely completely on embedded sensors, precision manufacturing and expert-designed forward kinematic models to understand where they are in 3D: 3/n

Vincent Sitzmann 的头像
Vincent Sitzmann1 年前

Conventional robots are “blind” when it comes to state estimation! This constrains their design to be “simple” such that an expert can design a model & requires expensive manufacturing & sensors. But bio-inspired, multi-material, and cheap robots cannot be modeled easily! 4/n

Vincent Sitzmann 的头像
Vincent Sitzmann1 年前

Our approach enables this pneumatic bio-inspired hand to perform physical tasks and allows the $220 poppy arm to draw “MIT” in the air, all with using a single RGB camera as the only sensor! None of these motion trajectories are prescribed as part of our training data. 5/n

Vincent Sitzmann 的头像
Vincent Sitzmann1 年前

Neural Jacobian Fields enable us to equip any robot with vision-based control, irrespective of its sensors, fabrication, material, or actuation. Each point is mapped to its “system Jacobian”, which maps a change in motor commands to the 3D motion of that point! 6/n

Vincent Sitzmann 的头像
Vincent Sitzmann1 年前

The Jacobian Field can be directly used for inverse dynamics control. Given desired motions, our model solves for the corresponding control command at interactive speeds. 7/n

Vincent Sitzmann 的头像
Vincent Sitzmann1 年前

Neural Jacobian Fields are trained completely self-supervised, from multi-view videos of a robot executing random commands - no human labels or intervention. At test time, a *single* image suffices to reconstruct them, for closed-loop control from a single RGB camera. 8/n

Vincent Sitzmann 的头像
Vincent Sitzmann1 年前

The learned Jacobian Field is interpretable: we can visualize it by color-coding sensitivity to the different control channels. In this way, we can see that the Jacobian field correctly discovers the 3D kinematics of the robot. 9/n

Vincent Sitzmann 的头像
Vincent Sitzmann1 年前

We’re not advocating for *only* using vision. In this paper, that’s what we do, to show how *surprisingly powerful* it can be! But: the more sensors, the better. It is straightforward to add more sensors - just add them to the input to reconstruct the jacobian field! 10/n

Vincent Sitzmann 的头像
Vincent Sitzmann1 年前

Neural Jacobian Fields could make robotic automation more affordable by enabling control of cheap robots, as they allow us to skip expensive sensors, enable new fabrication techniques such as 3D printing, and are robust to imprecise manufacturing! 11/n

Vincent Sitzmann 的头像
Vincent Sitzmann1 年前

There are still ways to go - but we think that this direction is super promising and just the beginning of an exciting arc of building representations of robotic embodiments that can bridge vision, proprioception, touch, and more! 12/n

Vincent Sitzmann 的头像
Vincent Sitzmann1 年前

Again, shoutout to the amazing team at MIT - make sure to check them out! @sizhe_lester_li, @annan__zhang, @BoyuanChen0, Hanna Matusik, Chao Liu, and Daniela Rus! Code & everything is already released: 13/n

Vincent Sitzmann 的头像
Vincent Sitzmann1 年前

Also, I should add we are aware of another paper that is named "Neural Jacobian Fields": It's a completely different idea and problem and not directly related, but still really, really cool (check it out!)...

Vincent Sitzmann 的头像
Vincent Sitzmann1 年前

Our paper is actually called "Unifying 3D Representation and Control of Diverse Robots with a Single Camera". We thought about not naming the core method "Neural Jacobian Fields", but it makes the most sense, b/c we are really modeling the "System Jacobian", a staple in robotics.

相关视频