
0xSero
@0xSero • 51,738 subscribers
Dad | Open Source | Back to Pleroma | ⵣ https://t.co/aSLDkVhImo
Shorts
Videos

Finally, full precision MiniMax-M2.7 running at home. 100 tokens/s decode 5050 tokens/s prefill
0xSero71,970 views • 1 month ago

Local AI can be so good, but you’d need about 12k USD to get it. Then it’s not so great Here’s a Q4 of Qwen3.5-262B-REAP Weights 131GB KVCache 50GB 256,000 context 350 tokens/s prefill warmup 4,000 tokens/s prefill cache 36 tokens/s generation Vision enabled REAP is good
0xSero87,504 views • 2 months ago

I LOVE Deepseek-v4-flash, incredibly reliable and capable, logical. It's lacking in frontend but I have MiMo for that. I would recommend any company spending 100k+ a year on AI to purchase 8-10~ 6000s and have a few of the works to have them blind test these models for work.
0xSero47,272 views • 1 month ago

Let me save you hours of testing frontends. If you're ever working on a front-end, instead of writing tests, and adding puppeteer slop to your repo 1. Get an llm to write you with whatever needs to be tested 2. Copy that, go to browser 3. Open localhost with your selected app 4. Use Claude Chrome Extension or Parchi 5. Send it the prompt 6. QA engineering, there you go. Use models results and pass it back to your coding agent to fix whatever is flagged.
0xSero109,697 views • 4 months ago

I figured out how to get Claude working anywhere without extra usage and technically in line with Anthropic's TOS. I don't recommend you adopt this, might get you banned but as you can see, no errors, no extra usage consumed. The point of this is to demonstrate futility.
0xSero51,550 views • 1 month ago

For my friends with design taste, what do you think? Production ready?
0xSero14,472 views • 19 days ago