正在加载视频...
视频加载失败
I trained a 12M parameter LLM on my own ML framework using a Rust backend and CUDA kernels for flash attention, AdamW, and more. Wrote the full transformer architecture, and BPE tokenizer from scratch. The framework features: - Custom CUDA kernels (Flash Attention, fused LayerNorm, fused GELU) for 3x... show more
0 条评论
暂无评论
原始帖子的评论将显示在这里
相关视频
0:22
Sensitive content
This is my journey, I'm 4 years on E now and I changed a lot and I still want to change more for better, thank you for being with me these last 2 years that I've been on the platform, I want to make so much more for me and for you! Thank you and happy new year!
Amy Bunny
25,862 次观看 • 1 年前


