
0xSero
@0xSero • 51,738 subscribers
Dad | Open Source | Back to Pleroma | ⵣ https://t.co/aSLDkVhImo
Shorts
Videos

I yapped about LLM Compression for 40 minutes, how much misinformation did i spread this time (,:
0xSero34,805 Aufrufe • vor 5 Tagen

Finally, full precision MiniMax-M2.7 running at home. 100 tokens/s decode 5050 tokens/s prefill
0xSero71,970 Aufrufe • vor 1 Monat

Local AI can be so good, but you’d need about 12k USD to get it. Then it’s not so great Here’s a Q4 of Qwen3.5-262B-REAP Weights 131GB KVCache 50GB 256,000 context 350 tokens/s prefill warmup 4,000 tokens/s prefill cache 36 tokens/s generation Vision enabled REAP is good
0xSero87,504 Aufrufe • vor 2 Monaten

I LOVE Deepseek-v4-flash, incredibly reliable and capable, logical. It's lacking in frontend but I have MiMo for that. I would recommend any company spending 100k+ a year on AI to purchase 8-10~ 6000s and have a few of the works to have them blind test these models for work.
0xSero47,272 Aufrufe • vor 1 Monat

Let me save you hours of testing frontends. If you're ever working on a front-end, instead of writing tests, and adding puppeteer slop to your repo 1. Get an llm to write you with whatever needs to be tested 2. Copy that, go to browser 3. Open localhost with your selected app 4. Use Claude Chrome Extension or Parchi 5. Send it the prompt 6. QA engineering, there you go. Use models results and pass it back to your coding agent to fix whatever is flagged.
0xSero109,697 Aufrufe • vor 4 Monaten

I figured out how to get Claude working anywhere without extra usage and technically in line with Anthropic's TOS. I don't recommend you adopt this, might get you banned but as you can see, no errors, no extra usage consumed. The point of this is to demonstrate futility.
0xSero51,550 Aufrufe • vor 1 Monat

For my friends with design taste, what do you think? Production ready?
0xSero14,472 Aufrufe • vor 19 Tagen