Loading video...

Video Failed to Load

Go Home

Introducing Neural Jacobian Fields, robot 3D kinematic models learned only from vision! They can model & control robots from just a single RGB camera, even those w/ intractable kinematics & no embedded sensors such as soft, 3D-printed pneumatic hands! 1/n

54,023 views • 1 year ago •via X (Twitter)

14 Comments

Vincent Sitzmann's profile picture
Vincent Sitzmann1 year ago

This project was led by the outstanding @sizhe_lester_li, in a multi-disciplinary collaboration with @annan__zhang, @BoyuanChen0, Hanna Matusik, Chao Liu, and Daniela Rus! Amazing team - truly grateful for the soft robot chops hands-on contributed by @annan__zhang & co! 2/n

Vincent Sitzmann's profile picture
Vincent Sitzmann1 year ago

While we humans constantly use vision for closed-loop control of our hands and limbs, today’s robots instead rely completely on embedded sensors, precision manufacturing and expert-designed forward kinematic models to understand where they are in 3D: 3/n

Vincent Sitzmann's profile picture
Vincent Sitzmann1 year ago

Conventional robots are “blind” when it comes to state estimation! This constrains their design to be “simple” such that an expert can design a model & requires expensive manufacturing & sensors. But bio-inspired, multi-material, and cheap robots cannot be modeled easily! 4/n

Vincent Sitzmann's profile picture
Vincent Sitzmann1 year ago

Our approach enables this pneumatic bio-inspired hand to perform physical tasks and allows the $220 poppy arm to draw “MIT” in the air, all with using a single RGB camera as the only sensor! None of these motion trajectories are prescribed as part of our training data. 5/n

Vincent Sitzmann's profile picture
Vincent Sitzmann1 year ago

Neural Jacobian Fields enable us to equip any robot with vision-based control, irrespective of its sensors, fabrication, material, or actuation. Each point is mapped to its “system Jacobian”, which maps a change in motor commands to the 3D motion of that point! 6/n

Vincent Sitzmann's profile picture
Vincent Sitzmann1 year ago

The Jacobian Field can be directly used for inverse dynamics control. Given desired motions, our model solves for the corresponding control command at interactive speeds. 7/n

Vincent Sitzmann's profile picture
Vincent Sitzmann1 year ago

Neural Jacobian Fields are trained completely self-supervised, from multi-view videos of a robot executing random commands - no human labels or intervention. At test time, a *single* image suffices to reconstruct them, for closed-loop control from a single RGB camera. 8/n

Vincent Sitzmann's profile picture
Vincent Sitzmann1 year ago

The learned Jacobian Field is interpretable: we can visualize it by color-coding sensitivity to the different control channels. In this way, we can see that the Jacobian field correctly discovers the 3D kinematics of the robot. 9/n

Vincent Sitzmann's profile picture
Vincent Sitzmann1 year ago

We’re not advocating for *only* using vision. In this paper, that’s what we do, to show how *surprisingly powerful* it can be! But: the more sensors, the better. It is straightforward to add more sensors - just add them to the input to reconstruct the jacobian field! 10/n

Vincent Sitzmann's profile picture
Vincent Sitzmann1 year ago

Neural Jacobian Fields could make robotic automation more affordable by enabling control of cheap robots, as they allow us to skip expensive sensors, enable new fabrication techniques such as 3D printing, and are robust to imprecise manufacturing! 11/n

Vincent Sitzmann's profile picture
Vincent Sitzmann1 year ago

There are still ways to go - but we think that this direction is super promising and just the beginning of an exciting arc of building representations of robotic embodiments that can bridge vision, proprioception, touch, and more! 12/n

Vincent Sitzmann's profile picture
Vincent Sitzmann1 year ago

Again, shoutout to the amazing team at MIT - make sure to check them out! @sizhe_lester_li, @annan__zhang, @BoyuanChen0, Hanna Matusik, Chao Liu, and Daniela Rus! Code & everything is already released: 13/n

Vincent Sitzmann's profile picture
Vincent Sitzmann1 year ago

Also, I should add we are aware of another paper that is named "Neural Jacobian Fields": It's a completely different idea and problem and not directly related, but still really, really cool (check it out!)...

Vincent Sitzmann's profile picture
Vincent Sitzmann1 year ago

Our paper is actually called "Unifying 3D Representation and Control of Diverse Robots with a Single Camera". We thought about not naming the core method "Neural Jacobian Fields", but it makes the most sense, b/c we are really modeling the "System Jacobian", a staple in robotics.

Related Videos