
Fangchen Liu
@fangchenliu_ • 1,402 subscribers
research @GoogleDeepMind, phd @berkeley_ai
Shorts
Videos

1/N Most Vision-Language-Action models need tons of data for finetuning, and still fail for new objects and instructions. Introducing OTTER, a lightweight, easy-to-train model that uses text-aware visual features to nail unseen tasks out of the box! Here's how it works 👇
Fangchen Liu68,288 Aufrufe • vor 1 Jahr
Keine weiteren Inhalte verfügbar