正在加载视频...
视频加载失败
Some more thoughts about Yann interview: Even if LLMs work great, that's missing the point. Everyone's doing the same thing now. More scale, more data, longer CoT, tweak RL. But the path to get there was completely stochastic. Attention, transformers, scaling laws, RLHF, none of it was obvious, it... show more
0 条评论
暂无评论
原始帖子的评论将显示在这里
