Video wird geladen...

Video konnte nicht geladen werden

Zur Startseite

Introducing Masked Trajectory Modeling (MTM), a new general-purpose framework for sequential decision making. A single transformer trained with MTM can exhibit multiple capabilities by simply choosing different masking patterns at inference time. Accepted at ICML 2023. 🧵👇

93,089 Aufrufe • vor 3 Jahren •via X (Twitter)

11 Kommentare

Profilbild von Philipp Wu
Philipp Wuvor 3 Jahren

MTM trains by reconstructing a masked input sequence using random autoregressive masking (think BERT with a prediction prior). We prompt the model with different masking patterns to achieve specific capabilities such as behavior cloning and inverse dynamics.

Profilbild von Philipp Wu
Philipp Wuvor 3 Jahren

We also evaluate MTM in offline RL settings, such as D4RL. At test time we prompt the model with a desired return. We find that MTM is comparable to prior work while also also having the capability to be used for other tasks.

Profilbild von Philipp Wu
Philipp Wuvor 3 Jahren

MTM is able to train on datasets with missing data modes. Ex: we might have a large dataset of state trajectories, but only a small amount with action labels. MTM is able to train on both without any change to the algorithm, simply by treating missing data modes as a forced mask.

Profilbild von Philipp Wu
Philipp Wuvor 3 Jahren

We test if MTM is able to leverage the trajectories without actions. Here we compare various methods that only train on the labeled subset and our Heteromodal MTM which also trains on trajectories without action labels. Heteromodal MTM is able to improve task performance.

Profilbild von Philipp Wu
Philipp Wuvor 3 Jahren

MTM can also be used to learn representations. After pretraining on generic trajectories, we use TD3 on top of the learned MTM representations. We find that these representations help with faster learning and in some cases, improve on asymptotic performance.

Profilbild von Philipp Wu
Philipp Wuvor 3 Jahren

If you are interested in this line of work, also checkout MaskDP ( from my lab mate @fangchenliu. Our work extends on some of the ideas presented here and explores a different set of capabilities and features of this paradigm.

Profilbild von Philipp Wu
Philipp Wuvor 3 Jahren

If you want to get a quick start with MTM, checkout this notebook which illustrates how to train and use MTM on a simple sinusoidal dataset.

Profilbild von Philipp Wu
Philipp Wuvor 3 Jahren

Thanks for the help and assistance of amazing collaborators! @arjunmajum, @kevinleestone, @yixin_lin_, @IMordatch, @pabbeel, @aravindr93 If you are interested in more details checkout the links below! Website: Code:

Profilbild von Philipp Wu
Philipp Wuvor 3 Jahren

Paper now out on arxiv!

Profilbild von kourosh hakhamaneshi
kourosh hakhamaneshivor 3 Jahren

Nice. It finally worked. Congrats. 🎉

Profilbild von Philipp Wu
Philipp Wuvor 3 Jahren

😂 thanks kourosh!!

Ähnliche Videos