正在加载视频...

视频加载失败

How to use simulation data for real-world robot manipulation? We present sim-and-real co-training, a simple recipe for manipulation. We demonstrate that sim data can significantly enhance real-world performance, even with notable differences between the sim and the real. (1/n)

44,173 次观看 • 1 年前 •via X (Twitter)

11 条评论

Zhenyu Jiang 的头像
Zhenyu Jiang1 年前

Paper: Website: We consider two types of simulation datasets: Task-Aware Digital Cousins and Task-Agnostic Prior Simulation Data. Task-Aware Digital Cousins: First introduced by Dai et al., digital cousins are virtual assets that, unlike a digital twin, do not explicitly model a real-world counterpart but still exhibit similar geometric and semantic affordances. In this work, we use "task-aware digital cousins" to refer to simulation tasks that share the same task semantics, namely the object categories in the environment and the same behaviors. (2/n)

Zhenyu Jiang 的头像
Zhenyu Jiang1 年前

Task-Agnostic Prior Simulation Data: We also consider existing large-scale simulation datasets, which require no additional efforts on designing new tasks or collecting new data and have significantly more diversity but less alignment. (3/n)

Zhenyu Jiang 的头像
Zhenyu Jiang1 年前

Through comprehensive experiments, we present a simple recipe for effectively utilizing simulation data in real-world manipulation tasks: 1. Task and scene composition. Use task-aware digital cousins with similar task and scene compositions to real-world tasks. Multi-task prior simulation data can still help even with different compositions. 2. Object composition and initialization. Incorporate diverse objects and varying placements in simulation to improve generalization. 3. Task-aware digital cousin alignment. Ensure simulation tasks share the same definition and success criteria as real-world tasks. Similar camera viewpoints help, but perfect alignment isn't necessary. 4. Co-training hyperparameters. Use significantly more simulation data than real-world data and carefully tune the co-training ratio. (4/n)

Zhenyu Jiang 的头像
Zhenyu Jiang1 年前

Our sim-and-real co-training pipeline is as follows. (5/n)

Zhenyu Jiang 的头像
Zhenyu Jiang1 年前

We verified that sim-and-real co-training is compatible with large-scale imitation learning. Co-training with simulation data boosts the real-world performance in data-rich settings. (6/n)

Zhenyu Jiang 的头像
Zhenyu Jiang1 年前

Our recipe presented above comes from a comprehensive study across 11 different tasks and 2 embodiments to understand which dataset composition factors in simulation and real-world datasets matter the most. (7/n)

Zhenyu Jiang 的头像
Zhenyu Jiang1 年前

Our strategy allows agents to generalize to novel object entities and poses unseen in the real-world dataset. (8/n)

Zhenyu Jiang 的头像
Zhenyu Jiang1 年前

We find that one of the most important hyperparameters for effective co-training is the co-training ratio between sim and real data. In our experiments, a co-training ratio of 99% yielded the best performance. (9/n)

Zhenyu Jiang 的头像
Zhenyu Jiang1 年前

We also find camera alignment to be critical for successful co-training with task-aware digital cousin data. Training policies on severely misaligned simulation data results in a significant drop in performance compared to policies co-trained with properly aligned digital cousin data. On the Panda arm CounterToSinkPnP task, the cotraining success rate dropped from 67% to 56%, while in the GR-1 humanoid CupPnP task, it declined from 95% to 70%. However, the aligned camera does not need to be strictly identical to the real-world camera. (10/n)

Zhenyu Jiang 的头像
Zhenyu Jiang1 年前

This work is done at NVIDIA’s GEAR lab and UT Austin with amazing collaborators @abhirammaddukur, @Lawrence_Y_Chen, @snasiriany, @yuqi_xie5, Yu Fang, Wenqi Huang, @zuwang95, @Zhenjia_Xu, @nc__dev, @scott_e_reed, @Ken_Goldberg, @AjayMandlekar, @DrJimFan, and @yukez.

Lucid Scientific, Inc. 的头像
Lucid Scientific, Inc.1 年前

Expand the possibilities of your metabolic research. Resipher tracks real-time cellular oxygen consumption in standard 96-well plates, delivering continuous real-time data directly from your incubator. Request a free virtual demo or quote today >>

相关视频

Today, we're joined by Nikita Rudin, co-founder and CEO of Flexion Robotics to discuss the gap between current robotic capabilities and what’s required to deploy fully autonomous robots in the real world. Nikita explains how reinforcement learning and simulation have driven rapid progress in robot locomotion—and why locomotion is still far from “solved.” We dig into the sim2real gap, and how adding visual inputs introduces noise and significantly complicates sim-to-real transfer. We also explore the debate between end-to-end models and modular approaches, and why separating locomotion, planning, and semantics remains a pragmatic approach today. Nikita also introduces the concept of "real-to-sim", which uses real-world data to refine simulation parameters for higher fidelity training, discusses how reinforcement learning, imitation learning, and teleoperation data are combined to train robust policies for both quadruped and humanoid robots, and introduces Flexion's hierarchical approach that utilizes pre-trained Vision-Language Models (VLMs) for high-level task orchestration with Vision-Language-Action (VLA) models and low-level whole-body trackers. Finally, Nikita shares the behind-the-scenes in humanoid robot demos, his take on reinforcement learning in simulation versus the real world, the nuances of reward tuning, and offers practical advice for researchers and practitioners looking to get started in robotics today. 🗒️ For the full list of resources for this episode, visit the show notes page: 📖 CHAPTERS =============================== 00:00 - Introduction 04:07 - Is robot locomotion solved? 06:04 - Sim-to-real gap 08:58 - Adding semantics to policies 09:42 - Modular vs end-to-end architectures 10:29 - Planner model 12:21 - Adapting RL techniques from quadrupeds to humanoids 15:39 - Behind robot demos 18:09 - Humanoid robots in home environments 22:03 - Training approach 23:56 - VLA models 27:59 - Closing the sim-to-real gap 32:55 - Task orchestration using VLMs 36:38 - Tool use 38:10 - Model hierarchy 43:37 - Simulator versus simulation environment 44:57 - Combining imitation learning and reinforcement learning 46:42 - RL in real world versus RL in simulation 52:58 - Reward tuning and value functions in robotics 56:38 - Predictions 1:00:10 - Humanoids, quadropeds, and wheeled platforms 1:02:45 - Advice, recommended robot kits, and community pla

The TWIML AI Podcast

22,264 次观看 • 5 个月前