正在加载视频...
视频加载失败
Running a single deep coding model at max context on Cerebras requires 24 systems ($24M Capex) just to support 256 concurrent users. At that scale, $100M gets you way more memory bandwidth in standard GB300 racks.
0 条评论
暂无评论
原始帖子的评论将显示在这里
