Aviral Kumar's banner

Aviral Kumar

@aviral_kumar2 • 6,022 subscribers

Assistant Professor of CS & ML at @CarnegieMellon. PhD from UC Berkeley.

Shorts

🚨Current scalable RL algos train a policy w/o value func, which is limiting with learning in open-ended, non-stationary, dynamic environments. But, how to scale value-based RL with more data/compute is unclear... Not anymore: presenting scaling laws for value-based RL 🧵⬇️

🚨Current scalable RL algos train a policy w/o value func, which is limiting with learning in open-ended, non-stationary, dynamic environments. But, how to scale value-based RL with more data/compute is unclear... Not anymore: presenting scaling laws for value-based RL 🧵⬇️

37,301 просмотров

Videos

Anya Rossi

sweetdream.ai

SweetDream.ai•Sponsored•Livecam

Watch Anya Live

Anya is streaming live right now! Join her private show and enjoy exclusive content.

Exclusive private shows

1.2k viewers online

Private Show

Join now for exclusive access

Free preview available • Premium content

🚨🚨New paper: if you want robots to do bimanual long-horizon tasks well, try RaC: a human in-the-loop data collection protocol that naturally amplifies Recovery & Correction behaviors + trains on them. 📈 data efficiency 10x vs prior results (+many nice properties). 🧵⬇️

🚨🚨New paper: if you want robots to do bimanual long-horizon tasks well, try RaC: a human in-the-loop data collection protocol that naturally amplifies Recovery & Correction behaviors + trains on them. 📈 data efficiency 10x vs prior results (+many nice properties). 🧵⬇️

25,884 просмотров • 10 месяцев назад

Больше нет контента для загрузки