Loading video...
Video Failed to Load
THIS DEVELOPER JUST RAN A TRILLION PARAMETER MODEL ON 4 MAC STUDIOS - 10X FASTER AND 5X CHEAPER THAN CLOUD CODE 19:00 he says it out loud. "we just ran a trillion parameter model. 30 something tokens per second. wow." RDMA over Thunderbolt made the cluster 10x faster than... show more
78,209 views • 2 days ago •via X (Twitter)
0 Comments
No comments available
Comments from the original post will appear here
Related Videos
671-billion-parameter DeepSeek-R1 model at up to 3,872 tokens per second
AK
227,579 views • 1 year ago
