
Sham Kakade
@ShamKakade6 • 18,795 subscribers
Harvard Professor. Full stack ML and AI. Co-director of the Kempner Institute for the Study of Artificial and Natural Intelligence.
Videos

1/ Au revoir, RLVR. New work: EBFT (Energy-Based Fine-Tuning), a post-training method that directly optimizes the long-horizon behavior of model generations, addressing SFT’s deployment-time error amplification without relying on sparse, task-specific rewards.
Sham Kakade266,585 просмотров • 3 месяцев назад
Больше нет контента для загрузки