Video yükleniyor...
Video Yüklenemedi
I trained a 12M parameter LLM on my own ML framework using a Rust backend and CUDA kernels for flash attention, AdamW, and more. Wrote the full transformer architecture, and BPE tokenizer from scratch. The framework features: - Custom CUDA kernels (Flash Attention, fused LayerNorm, fused GELU) for 3x... show more
809,167 görüntüleme • 1 ay önce •via X (Twitter)
0 Yorum
Yorum bulunmuyor
Orijinal gönderinin yorumları burada görünecek
Benzer Videolar
0:22
Sensitive content
This is my journey, I'm 4 years on E now and I changed a lot and I still want to change more for better, thank you for being with me these last 2 years that I've been on the platform, I want to make so much more for me and for you! Thank you and happy new year!
Amy Bunny
25,862 görüntüleme • 1 yıl önce


