
Red Hat AI
@RedHat_AI • 11,002 subscribers
Accelerating AI innovation with open platforms and community. The future of AI is open.
Videos

Michael Goin (Michael Goin) walks through what's new in vLLM v0.17, v0.18, and v0.19 in ~8 minutes. Flash Attention 4, new performance modes, zero-bubble async scheduling, online MXFP4 quantization, Gemma 4, and a lot more. 1,592 commits. 682 contributors (163 new). 🎉 🚀
Red Hat AI22,983 views • 2 months ago

A full year of vLLM in 30 minutes by vLLM Lead from UC Berkeley, Simon Mo. Model and hardware usage trends, model architectures, API evolution, V1 engine rebuild, multimodal progress, expanding hardware support, and more. Plus how we are thinking about 2026. Enjoy!
Red Hat AI15,697 views • 5 months ago
No more content to load