正在加载视频...
视频加载失败
Reinforcement learning should be able to improve upon behaviors seen when training. In practice, RL agents often struggle to generalize to new long-horizon behaviors. Our new paper studies *horizon generalization*, the degree RL algorithms generalize to reaching distant goals. 1/
79,514 次观看 • 1 年前 •via X (Twitter)
0 条评论
暂无评论
原始帖子的评论将显示在这里
