
yazin
@yazins • 8,024 subscribers
messing with AI / founder @amalinvest (YC, W22)
Shorts
Videos

BREAKTHROUGH 85% accuracy at just 115MB (!) i almost can't believe it 📹 demo video below for context: i spent days on offline Quran recognition. Whisper, Moonshine, wav2vec2 fine-tunes, layer pruning, CTC rescoring, beam search tricks. nothing cracked 81% accuracy. i then plugged in NVIDIA's pretrained FastConformer and immediately got 90.7%. also, it's 10x faster 🐇 (0.33s vs 3.24s) and 10x smaller 👌 (115MB vs 1.2GB). i tried fine-tuning it on Quranic audio, figuring I could push it even higher. Ran four different configs. every single one came back worse than the stock pretrained model. no idea what's going on there yet. code/repo in next tweet
yazin62,151 görüntüleme • 3 ay önce
Daha fazla içerik yok.