正在加载视频...
视频加载失败
We open-sourced QeRL — Quantization-enhanced Reinforcement Learning ! 🧠 4-bit quantized RL training 💪 Train a 32B LLM on a single H100 GPU ⚙️ 1.7× faster overall training 🎯 Accuracy on par with bfloat16-level accuracy 🔥 Supports NVFP4 quantization format Moreover, we show that quantization helps exploration in RL... show more
69,720 次观看 • 7 个月前 •via X (Twitter)
0 条评论
暂无评论
原始帖子的评论将显示在这里
