Video wird geladen...
Video konnte nicht geladen werden
I trained a 12M parameter LLM on my own ML framework using a Rust backend and CUDA kernels for flash attention, AdamW, and more. Wrote the full transformer architecture, and BPE tokenizer from scratch. The framework features: - Custom CUDA kernels (Flash Attention, fused LayerNorm, fused GELU) for 3x... show more
808,841 Aufrufe • vor 1 Monat •via X (Twitter)
0 Kommentare
Keine Kommentare verfügbar
Kommentare vom Original-Post werden hier angezeigt
Ähnliche Videos
0:22
Sensitive content
This is my journey, I'm 4 years on E now and I changed a lot and I still want to change more for better, thank you for being with me these last 2 years that I've been on the platform, I want to make so much more for me and for you! Thank you and happy new year!
Amy Bunny
25,862 Aufrufe • vor 1 Jahr


