Video yükleniyor...

Video Yüklenemedi

Ana Sayfaya Dön

This is a single uncut video, showing a robot learning several tasks instantly, after just one demonstration each ... This is possible because we've now been able to achieve in-context learning for everyday robotics tasks, and I'm very excited to announce our latest paper: 🎆 Instant Policy: In-Context Imitation...

74,663 görüntüleme • 1 yıl önce •via X (Twitter)

10 Yorum

Edward Johns profil fotoğrafı
Edward Johns1 yıl önce

In-context learning is where a trained model accepts examples of a new task (the "context") at its input, and can then make predictions for that same task given a novel instance of it, without any further training or weight updates. Achieving this in robotics is very exciting: with Instant Policy, we can now provide one or a few demonstrations (the "context"), and the robot instantly learns a closed-loop policy for that task, which it can then immediately perform. (2/6)

Edward Johns profil fotoğrafı
Edward Johns1 yıl önce

The figure below shows our network architecture, which jointly expresses the context (demonstrations, as sequences of observations and actions), the current observation, and the future actions. Observations are point clouds, and actions are relative gripper poses. During inference, actions are predicted using a learned diffusion process on the graph nodes representing the actions, conditioned on the demonstrations and the current observation. (3/6)

Edward Johns profil fotoğrafı
Edward Johns1 yıl önce

One very exciting aspect of Instant Policy is that we don't need any real-world training data. The entire network can be trained with simulated "pseudo-demonstrations", which are arbitrary trajectories with random objects, all in simulation. And we found very promising scaling laws: we can continue to generate these pseudo-demonstrations in simulation, and the performance of the network continues to improve. (4/6)

Edward Johns profil fotoğrafı
Edward Johns1 yıl önce

Beyond just regular imitation learning, we also discovered two intriguing downstream applications: (1) Cross-embodiment transfer from human-hand demonstrations to robot policies. (2) Zero-shot transfer to language-defined tasks without needing large language-annotated datasets. (5/6)

Edward Johns profil fotoğrafı
Edward Johns1 yıl önce

This was led by my excellent student Vitalis Vosylius (@vitalisvos19), in the final project of his PhD. To read the paper and see more videos, please visit And we have code and weights available on the webpage, for you to teach your own robot with Instant Policy. Please try it out, and let us know how you get on! Thanks for reading 😀 (6/6)

You Jiacheng profil fotoğrafı
You Jiacheng1 yıl önce

Great work! I have a small problem: how did you prompt SAM in this video? there is another person?

tOSUFever profil fotoğrafı
tOSUFever1 yıl önce

this is cool 😎

Ornias profil fotoğrafı
Ornias1 yıl önce

Feels like I'm watching an animal rather than a robot.

XXXin profil fotoğrafı
XXXin1 yıl önce

Seeing more and more works like this. Wondering how we can leverage the power of community to collect data efficiently in mass, and how the system generalizes under different configurations

Appy Pie profil fotoğrafı
Appy Pie1 yıl önce

Exciting breakthrough in robotics! With in-context learning, robots can now master tasks instantly after just one demonstration. This is a huge step forward in making robots more adaptable and efficient!

Benzer Videolar