Video yükleniyor...

Video Yüklenemedi

Ana Sayfaya Dön

In my past research experience, finding or developing an appropriate simulation environment, dataset, and benchmark has always been a challenge. Missing features, limited support, or unexpected bugs often occupied my days and nights. Moreover, current simulation platforms are relatively fragmented—making it challenging to replicate the success of the RT-X...

84,166 görüntüleme • 1 yıl önce •via X (Twitter)

12 Yorum

Haoran Geng profil fotoğrafı
Haoran Geng1 yıl önce

Beyond what we have showcased in the video above, RoboVerse has even more exciting features waiting to be explored: Teleoperation: We support multiple teleoperation methods in RoboVerse. We designed a mobile app that utilizes phone sensors to enable seamless teleoperation within the RoboVerse platform. Also, we partially support more devices like Mocap, VR, Keyboard, and Joysticks.

Haoran Geng profil fotoğrafı
Haoran Geng1 yıl önce

Real2Sim: We support Real-to-Sim toolset to reconstruct real-world assets from monocular video, utilizing 3D Gaussian Splatting techniques.

Haoran Geng profil fotoğrafı
Haoran Geng1 yıl önce

AI-Generated Tasks: Based on the unified configuration of MetaSim for tasks, we leverage LLM to combine the assets from RoboVerse data and generate new tasks, showing the potential of LLM for creative task generation.

Haoran Geng profil fotoğrafı
Haoran Geng1 yıl önce

Seamlessly Enable GPU Parallelism: RoboVerse makes it much easier to transfer tasks and benchmarks that previously didn’t support large-scale parallelism—enabling them to run large-scale parallel reinforcement learning efficiently on GPUs.

Haoran Geng profil fotoğrafı
Haoran Geng1 yıl önce

Huge thanks to the entire team for the incredible efforts on this ambitious project🥺!!! We sincerely hope every contribution pays off. We also encourage the wider community to get involved — together, let's drive real progress in robotics🚀!

Lucid Scientific, Inc. profil fotoğrafı
Lucid Scientific, Inc.1 yıl önce

Expand the possibilities of your metabolic research. Resipher tracks real-time cellular oxygen consumption in standard 96-well plates, delivering continuous real-time data directly from your incubator. Request a free virtual demo or quote today >>

Lwin Moe Aung profil fotoğrafı
Lwin Moe Aung1 yıl önce

Does it support Drake?

Haoran Geng profil fotoğrafı
Haoran Geng1 yıl önce

Not yet. We are working hard to include more! Also, pull request is highly appreciated :)

arun kumar singh profil fotoğrafı
arun kumar singh1 yıl önce

Does it support Mujoco or Mujoco JAX?

Haoran Geng profil fotoğrafı
Haoran Geng1 yıl önce

Yes! We do support MuJoCo! Check this and have a try!

David Blanco-Mulero profil fotoğrafı
David Blanco-Mulero1 yıl önce

Really amazing work! I see the deformables sim is mainly based on garmentlab. Have you thought about integrating the different deformable sims objects e.g. Mujoco does well on volumetric objects, maniskill, etc. rather than only Isaac-based?

Haoran Geng profil fotoğrafı
Haoran Geng1 yıl önce

Thank you for the suggestion! We are working hard to include more. Deformable engines from other simulators are indeed very useful, as they offer several advantages. Stay tuned!

Benzer Videolar

Everything you love about generative models — now powered by real physics! Announcing the Genesis project — after a 24-month large-scale research collaboration involving over 20 research labs — a generative physics engine able to generate 4D dynamical worlds powered by a physics simulation platform designed for general-purpose robotics and physical AI applications. Genesis's physics engine is developed in pure Python, while being 10-80x faster than existing GPU-accelerated stacks like Isaac Gym and MJX. It delivers a simulation speed ~430,000 faster than in real-time, and takes only 26 seconds to train a robotic locomotion policy transferrable to the real world on a single RTX4090 (see tutorial: The Genesis physics engine and simulation platform is fully open source at We'll gradually roll out access to our generative framework in the near future. Genesis implements a unified simulation framework all from scratch, integrating a wide spectrum of state-of-the-art physics solvers, allowing simulation of the whole physical world in a virtual realm with the highest realism. We aim to build a universal data engine that leverages an upper-level generative framework to autonomously create physical worlds, together with various modes of data, including environments, camera motions, robotic task proposals, reward functions, robot policies, character motions, fully interactive 3D scenes, open-world articulated assets, and more, aiming towards fully automated data generation for robotics, physical AI and other applications. Open Source Code: Project webpage: Documentation: 1/n

Zhou Xian

3,814,443 görüntüleme • 1 yıl önce

Synthetic data will provide the next trillion tokens to fuel our hungry models. I'm excited to announce MimicGen: massively scaling up data pipeline for robot learning! We multiply high-quality human data in simulation with digital twins. Using 50,000 training episodes across 18 tasks, multiple simulators, and even in the real-world! The idea is simple: 1. Humans tele-operate the robot to complete a task. It is extremely high-quality but also very slow and expensive. 2. We create a digital twin of the robot and the scene in high-fidelity, GPU-accelerated simulation. 3. We can now move objects around, replace with new assets, and even change the robot hand - basically augment the training data with procedural generation. 4. Export the successful episodes, and feed that to a neural network! You now have an near-infinite stream of data. One of the key reasons that robotics lags far behind other AI fields is the lack of data: you cannot scrape control signals from the internet. They simply don't exist in-the-wild. MimicGen shows the power of synthetic data and simulation to keep our scaling laws alive. I believe this principle apply beyond robotics. We are quickly exhausting the high-quality, real tokens from the web. Artificial intelligence from artificial data will be the way forward. We are big fans of the OSS community. As usual, we open-source everything, including the generated dataset! - Website: - Paper: - Dataset is hosted on HuggingFace (thanks AK!!): - Code: MimicGen is led by Ajay Mandlekar, deep dive in the thread:

Jim Fan

332,164 görüntüleme • 2 yıl önce

I don’t know if we live in a Matrix, but I know for sure that robots will spend most of their lives in simulation. Let machines train machines. I’m excited to introduce DexMimicGen, a massive-scale synthetic data generator that enables a humanoid robot to learn complex skills from only a handful of human demonstrations. Yes, as few as 5! DexMimicGen addresses the biggest pain point in robotics: where do we get data? Unlike with LLMs, where vast amounts of texts are readily available, you cannot simply download motor control signals from the internet. So researchers teleoperate the robots to collect motion data via XR headsets. They have to repeat the same skill over and over and over again, because neural nets are data hungry. This is a very slow and uncomfortable process. At NVIDIA, we believe the majority of high-quality tokens for robot foundation models will come from simulation. What DexMimicGen does is to trade GPU compute time for human time. It takes one motion trajectory from human, and multiplies into 1000s of new trajectories. A robot brain trained on this augmented dataset will generalize far better in the real world. Think of DexMimicGen as a learning signal amplifier. It maps a small dataset to a large (de facto infinite) dataset, using physics simulation in the loop. In this way, we free humans from babysitting the bots all day. The future of robot data is generative. The future of the entire robot learning pipeline will also be generative. 🧵

Jim Fan

165,215 görüntüleme • 1 yıl önce

Today, we're joined by Sergey Levine, associate professor at UC Berkeley EECS and co-founder of Physical Intelligence to discuss π0 (pi-zero), a general-purpose robotic foundation model. We dig into the model architecture, which pairs a vision language model (VLM) with a diffusion-based action expert, and the model training "recipe," emphasizing the roles of pre-training and post-training with a diverse mixture of real-world data to ensure robust and intelligent robot learning. We review the data collection approach, which uses human operators and teleoperation rigs, the potential of synthetic data and reinforcement learning in enhancing robotic capabilities, and much more. We also introduce the team’s new FAST tokenizer, which opens the door to a fully Transformer-based model and significant improvements in learning and generalization. Finally, we cover the open-sourcing of π0 and future directions for their research. 🎧 / 🎥 Listen or watch the full episode on our page: 📖 CHAPTERS =============================== 00:00 - Introduction 2:14 - Physical Intelligence 3:47 - Key challenges in robotic learning 6:13 - Reinforcement learning in π0 and robotic foundation models 8:36 - π0 VLM model architecture 15:33 - π0 model recipe 18:39 - Pre-training dataset 22:47 - Post-training 24:23 - Laundry folding demo 31:32 - Scaling laws on π0 model 34:57 - FAST 40:26 - Open sourcing π0 43:37 - Other robot types 46:27 - Future directions

The TWIML AI Podcast

19,942 görüntüleme • 1 yıl önce