Загрузка видео...

Не удалось загрузить видео

На главную

How can robots learn generalizable manipulation skills for diverse objects? Going beyond pick-and-place, our recent work “HACMan” enables complex interactions for unseen objects, such as flipping, pushing, or tilting, using spatial action maps + RL with point clouds. (w/ @MetaAI)

49,846 просмотров • 3 лет назад •via X (Twitter)

Комментарии: 10

Фото профиля Wenxuan Zhou
Wenxuan Zhou3 лет назад

We find that defining the right action space is crucial for learning a manipulation task. We explore an object-centric action representation in RL that consists of selecting a contact location on the object and a set of parameters describing the robot's movement after contact.

Фото профиля Wenxuan Zhou
Wenxuan Zhou3 лет назад

Our object-centric action representation has two benefits. It is… 1. Spatially-grounded: because the learned contact location is selected from the observed object points. 2. Temporally-abstracted: because we focus only on learning the contact-rich portions of the action.

Фото профиля Wenxuan Zhou
Wenxuan Zhou3 лет назад

With off-policy RL, given a point cloud, the actor outputs per-point motion parameters (Actor Map) while the critic outputs per-point Q-values (Critic Map). The Critic Map is not only used to update the actor but also serves as the scores for selecting the contact location.

Фото профиля Wenxuan Zhou
Wenxuan Zhou3 лет назад

We evaluate our method with a 6D object pose alignment task with randomized initial poses, randomized 6D goals, and diverse unseen objects in both simulation and in the real world.

Фото профиля Wenxuan Zhou
Wenxuan Zhou3 лет назад

HACMan outperforms the baselines, with a larger margin for more challenging tasks. Success rates for simple tasks - pushing a single object to an in-plane goal - are high for all methods, but only HACMan achieves high success rates for 6D alignment of diverse objects.

Фото профиля Wenxuan Zhou
Wenxuan Zhou3 лет назад

Check out the paper and the website for more information and video results showing HACMan generalizing to different objects and goals! w/@bwww08, Fan Yang, @chris_j_paxton, @davheld

Фото профиля Brett Adcock
Brett Adcock3 лет назад

@MetaAI Congrats, thanks for sharing.

Фото профиля Arnav Wadhwa
Arnav Wadhwa3 лет назад

@MetaAI Amazing work! I’m wondering about the challenges/improvements tradeoff when using a human-hand like end effector with 5 fingers. Curious to know what you think

Фото профиля Wenxuan Zhou
Wenxuan Zhou3 лет назад

@MetaAI Multi-fingered hands may allow a wider variety of motions and have more tolerance (picking an object with a multi-fingered hand can be less sensitive to object shapes than a simple gripper). However, they are more expensive, easier to break, and have a bigger sim2real gap.

Фото профиля Sasha Salter
Sasha Salter2 лет назад

@MetaAI Great use of temporal abstraction to simplify learning!

Похожие видео