
antirez
@antirez • 64,283 subscribers
Reproducible bugs are candies. I like programming too much for not liking automatic programming.
Shorts
Videos

DeepSeek v4 PRO running via SSD streaming on my 128GB MacBook m5 max. 1.6 trillion parameters.
antirez258,651 Aufrufe • vor 2 Tagen

I didn't expect DeepSeek v4 PRO (not Flash) to run well on the Mac Studio M3 Ultra with 512GB of RAM. This is 2 bit quantized with the same DwarfStar recipe used for Flash. 433GB GGUF file. 130 t/s prefill, 13 t/s generation. Prefill in the video is low because small prompt.
antirez168,072 Aufrufe • vor 20 Tagen

DS4 running on DGX Spark (GB10 / CUDA), private branch for now. 12 tokens/sec, the memory bandwidth is limited in this system, at 270GB/sec. But prefill is ways more alighed to M3 Max at ~200 t/s. I'll release when more mature, but it is almost sure that it will get merged.
antirez83,314 Aufrufe • vor 27 Tagen

Much better at 17 tokens/sec in M3 Max, fusing a few operations. 128gb m3 max.
antirez24,817 Aufrufe • vor 1 Monat
Keine weiteren Inhalte verfügbar