Aviral Kumar's banner
Aviral Kumar's profile picture

Aviral Kumar

@aviral_kumar25,816 subscribers

Assistant Professor of CS & ML at @CarnegieMellon. PhD from UC Berkeley.

Shorts

🚨Current scalable RL algos train a policy w/o value func, which is limiting with learning in open-ended, non-stationary, dynamic environments. But, how to scale value-based RL with more data/compute is unclear... Not anymore: presenting scaling laws for value-based RL 🧵⬇️

🚨Current scalable RL algos train a policy w/o value func, which is limiting with learning in open-ended, non-stationary, dynamic environments. But, how to scale value-based RL with more data/compute is unclear... Not anymore: presenting scaling laws for value-based RL 🧵⬇️

37,266 просмотров

Videos

Больше нет контента для загрузки