Loading video...

Video Failed to Load

Go Home

Very happy to share our new work APRL (+ open-source code release)! The important step forward we took here is enabling the robot to keep improving with more data—walking faster and adapting to new situations—where prior work saturates.

48,024 views • 2 years ago •via X (Twitter)

8 Comments

Laura Smith's profile picture
Laura Smith2 years ago

Navigating the trade-off between efficiency and performance is tough but important for real-world learning. APRL modulates the robot's exploration based on a notion of 'familiarity', i.e., the robot can explore more aggressively if it can predict dynamics and vice versa

Laura Smith's profile picture
Laura Smith2 years ago

Structuring exploration in this way allows us to not only train remarkably quickly in the real world like our prior "Walk in the Park" system but also reach much higher performance with further training, especially in more complex situations like soft terrain/inclines/outdoors

Laura Smith's profile picture
Laura Smith2 years ago

The code ( has all you need to get started (other than the robot itself): reset policy, simulated and real environments, and RL training. If you have a Go1, give it a try! It should take only a few minutes to set up ◡̈

Laura Smith's profile picture
Laura Smith2 years ago

Thanks to Yunhao Cao and my advisor @svlevine! see Sergey's thread here: + the previous "Walk in the Park" work we built on here:

Oliver Groth's profile picture
Oliver Groth2 years ago

Awesome work! Also, great to see more open source releases for robotics papers. Keep up the great work! 🙂

sorina's profile picture
sorina2 years ago

Really great work! I am wondering how this method can be further extended to ensure the robot has a natural walking gait, rather than a crawling-like walking gait.

Karol Hausman's profile picture
Karol Hausman2 years ago

Very cool, congrats @smithlaura1028 !

PRIV's profile picture
PRIV2 years ago

Wow.

Related Videos

Open science is how we continue to push technology forward and today at Meta FAIR we’re sharing eight new AI research artifacts including new models, datasets and code to inspire innovation in the community. More in the video from Joelle Pineau. This work is another important step towards our goal of achieving Advanced Machine Intelligence (AMI). What we’re releasing: • Meta Spirit LM: An open source language model for seamless speech and text integration. • Meta Segment Anything Model 2.1: An updated checkpoint with improved results on visually similar objects, small objects and occlusion handling. Plus a new developer suite to make it easier for developers to build with SAM 2. • Layer Skip: Inference code and fine-tuned checkpoints demonstrating a new method for enhancing LLM performance. • SALSA: New code to enable researchers to benchmark AI-based attacks in support of validating security for post-quantum cryptography. • Meta Lingua: A lightweight and self-contained codebase designed to train language models at scale. • Meta Open Materials: New open source models and the largest dataset of its kind to accelerate AI-driven discovery of new inorganic materials. • MEXMA: A new research paper and code for our novel pre-trained cross-lingual sentence encoder with coverage across 80 languages. • Self-Taught Evaluator: a new method for generating synthetic preference data to train reward models without relying on human annotations. Access to state-of-the-art AI creates opportunities for everyone. We’re excited to share this work and look forward to seeing the community innovation that results from it. Details and access to everything released by FAIR today ➡️

AI at Meta

150,222 views • 1 year ago