Загрузка видео...

Не удалось загрузить видео

На главную

Traditional tokenization methods for robotic actions struggle with high-frequency, dexterous tasks due to redundancy and inefficiency. Inspired by JPEG compression, Physical Intelligence has developed a compressed action representation that accelerates VLA model training 5x.

26,198 просмотров • 1 год назад •via X (Twitter)

Комментарии: 7

Фото профиля The Humanoid Hub
The Humanoid Hub1 год назад

More details in this thread:

Фото профиля UserInterface
UserInterface2 лет назад

Unveiling the Future of Prompt Engineering for Better AI Interactions #tech

Фото профиля navuud
navuud1 год назад

How does one build a laundry robot like this at home 😁

Фото профиля Brian Bellia
Brian Bellia1 год назад

So, does this mean the end of teleoperation as a means of training humanoids? I hope it spells the end of teleoperation, period. At this stage, it should be autonomous or nothing - except in rare cases like Optimus catching a ball.

Фото профиля The Humanoid Hub
The Humanoid Hub1 год назад

The DROID dataset they using for training is generated with teleop

Фото профиля Disarm.AGI.UBI
Disarm.AGI.UBI1 год назад

Give'm some practice. They will get better.

Фото профиля Rethynk AI
Rethynk AI1 год назад

Really like the way it put the 2nd on the top of the first. After intellectual capital, machines are getting better understanding of physical environment.

Похожие видео

Today, we're joined by Sergey Levine, associate professor at UC Berkeley EECS and co-founder of Physical Intelligence to discuss π0 (pi-zero), a general-purpose robotic foundation model. We dig into the model architecture, which pairs a vision language model (VLM) with a diffusion-based action expert, and the model training "recipe," emphasizing the roles of pre-training and post-training with a diverse mixture of real-world data to ensure robust and intelligent robot learning. We review the data collection approach, which uses human operators and teleoperation rigs, the potential of synthetic data and reinforcement learning in enhancing robotic capabilities, and much more. We also introduce the team’s new FAST tokenizer, which opens the door to a fully Transformer-based model and significant improvements in learning and generalization. Finally, we cover the open-sourcing of π0 and future directions for their research. 🎧 / 🎥 Listen or watch the full episode on our page: 📖 CHAPTERS =============================== 00:00 - Introduction 2:14 - Physical Intelligence 3:47 - Key challenges in robotic learning 6:13 - Reinforcement learning in π0 and robotic foundation models 8:36 - π0 VLM model architecture 15:33 - π0 model recipe 18:39 - Pre-training dataset 22:47 - Post-training 24:23 - Laundry folding demo 31:32 - Scaling laws on π0 model 34:57 - FAST 40:26 - Open sourcing π0 43:37 - Other robot types 46:27 - Future directions

The TWIML AI Podcast

19,942 просмотров • 1 год назад