正在加载视频...
视频加载失败
Karpathy told Dwarkesh that a 1 billion parameter model, trained on clean data, could hit the intelligence of today's 1.8 trillion parameter frontier. That is a 1,800x compression claim. The math behind it is more defensible than it sounds. When researchers at frontier labs look at random samples from... show more
507,825 次观看 • 2 个月前 •via X (Twitter)
0 条评论
暂无评论
原始帖子的评论将显示在这里
