Research @GoogleDeepMind, Prev: PhD @StanfordAILab, Stanford BS/MS
Shorts
Not all human-collected demos are created equal: ✔️ All are successful ❌ But some strategies are unreliable or brittle This can hurt final performance. Demo-SCORE self-curates reliable training data using online experience. Paper and videos: