正在加载视频...

视频加载失败

Introducing Masked Trajectory Modeling (MTM), a new general-purpose framework for sequential decision making. A single transformer trained with MTM can exhibit multiple capabilities by simply choosing different masking patterns at inference time. Accepted at ICML 2023. 🧵👇

93,089 次观看 • 3 年前 •via X (Twitter)

11 条评论

Philipp Wu 的头像
Philipp Wu3 年前

MTM trains by reconstructing a masked input sequence using random autoregressive masking (think BERT with a prediction prior). We prompt the model with different masking patterns to achieve specific capabilities such as behavior cloning and inverse dynamics.

Philipp Wu 的头像
Philipp Wu3 年前

We also evaluate MTM in offline RL settings, such as D4RL. At test time we prompt the model with a desired return. We find that MTM is comparable to prior work while also also having the capability to be used for other tasks.

Philipp Wu 的头像
Philipp Wu3 年前

MTM is able to train on datasets with missing data modes. Ex: we might have a large dataset of state trajectories, but only a small amount with action labels. MTM is able to train on both without any change to the algorithm, simply by treating missing data modes as a forced mask.

Philipp Wu 的头像
Philipp Wu3 年前

We test if MTM is able to leverage the trajectories without actions. Here we compare various methods that only train on the labeled subset and our Heteromodal MTM which also trains on trajectories without action labels. Heteromodal MTM is able to improve task performance.

Philipp Wu 的头像
Philipp Wu3 年前

MTM can also be used to learn representations. After pretraining on generic trajectories, we use TD3 on top of the learned MTM representations. We find that these representations help with faster learning and in some cases, improve on asymptotic performance.

Philipp Wu 的头像
Philipp Wu3 年前

If you are interested in this line of work, also checkout MaskDP ( from my lab mate @fangchenliu. Our work extends on some of the ideas presented here and explores a different set of capabilities and features of this paradigm.

Philipp Wu 的头像
Philipp Wu3 年前

If you want to get a quick start with MTM, checkout this notebook which illustrates how to train and use MTM on a simple sinusoidal dataset.

Philipp Wu 的头像
Philipp Wu3 年前

Thanks for the help and assistance of amazing collaborators! @arjunmajum, @kevinleestone, @yixin_lin_, @IMordatch, @pabbeel, @aravindr93 If you are interested in more details checkout the links below! Website: Code:

Philipp Wu 的头像
Philipp Wu3 年前

Paper now out on arxiv!

kourosh hakhamaneshi 的头像
kourosh hakhamaneshi3 年前

Nice. It finally worked. Congrats. 🎉

Philipp Wu 的头像
Philipp Wu3 年前

😂 thanks kourosh!!

相关视频