
antirez
@antirez • 64,283 subscribers
Reproducible bugs are candies. I like programming too much for not liking automatic programming.
Shorts
Videos

DeepSeek v4 PRO running via SSD streaming on my 128GB MacBook m5 max. 1.6 trillion parameters.
antirez258,651 просмотров • 2 дней назад

I didn't expect DeepSeek v4 PRO (not Flash) to run well on the Mac Studio M3 Ultra with 512GB of RAM. This is 2 bit quantized with the same DwarfStar recipe used for Flash. 433GB GGUF file. 130 t/s prefill, 13 t/s generation. Prefill in the video is low because small prompt.
antirez168,072 просмотров • 20 дней назад

DS4 running on DGX Spark (GB10 / CUDA), private branch for now. 12 tokens/sec, the memory bandwidth is limited in this system, at 270GB/sec. But prefill is ways more alighed to M3 Max at ~200 t/s. I'll release when more mature, but it is almost sure that it will get merged.
antirez83,314 просмотров • 27 дней назад

The first M5 max arrived! Many many thanks to our sponsors ⿻ Audrey Tang 唐鳳 and Niels Gron, the next will arrive on Monday.
antirez27,710 просмотров • 15 дней назад

Much better at 17 tokens/sec in M3 Max, fusing a few operations. 128gb m3 max.
antirez24,817 просмотров • 1 месяц назад
Больше нет контента для загрузки