正在加载视频...
视频加载失败
Async RL decouples rollouts from training, and that’s why Echo-2 is so efficient. Distributed actors on Echo-2 collect rollouts on their own schedule while the learner updates continuously. Less waiting. Higher throughput. Here’s an illustration👇
33,694 次观看 • 4 个月前 •via X (Twitter)
0 条评论
暂无评论
原始帖子的评论将显示在这里

