Loading video...
Video Failed to Load
Running a single deep coding model at max context on Cerebras requires 24 systems ($24M Capex) just to support 256 concurrent users. At that scale, $100M gets you way more memory bandwidth in standard GB300 racks.
93,448 views • 21 days ago •via X (Twitter)
0 Comments
No comments available
Comments from the original post will appear here
