Loading video...
Video Failed to Load
Karpathy told Dwarkesh that a 1 billion parameter model, trained on clean data, could hit the intelligence of today's 1.8 trillion parameter frontier. That is a 1,800x compression claim. The math behind it is more defensible than it sounds. When researchers at frontier labs look at random samples from... show more
507,774 views • 2 months ago •via X (Twitter)
0 Comments
No comments available
Comments from the original post will appear here
