
pradeep
@pradeep24 • 1,434 subscribers
co-founder @askjoai // oss: camofox // past lives: full-stack grocery, ridesharing, fortune 500, photos, lots of code.
Shorts
Videos

tested out antirez' ds4.c this morning. so impressive and delivers. on a M3 max, 128GB, stock ds4 settings: - 14–15 t/s at 62K pre-filled actual coding conversation - memory usage was flat during gen ~85GB res - disk cache is ~8GB for a full 100K context window - thermals were normal, light fan activity - inference server is rock solid so far biggest constraint: anytime there's a compact, we pay the wait-time price of a fresh prefill (~1min per 10k context) before we are back in action. sequential inference + multiple agents in parallel performance is unclear, will report back. I'm so amped.
pradeep162,212 Aufrufe • vor 1 Monat
Keine weiteren Inhalte verfügbar