
Fangchen Liu
@fangchenliu_ • 1,402 subscribers
research @GoogleDeepMind, phd @berkeley_ai
Shorts
Videos

1/N Most Vision-Language-Action models need tons of data for finetuning, and still fail for new objects and instructions. Introducing OTTER, a lightweight, easy-to-train model that uses text-aware visual features to nail unseen tasks out of the box! Here's how it works 👇
Fangchen Liu68,288 次观看 • 1 年前
没有更多内容可加载