Since the initial Attention is All you Need, 300 architectures have been contributed to Transformers. See the rise and fall of these architectures over time; crazy to see how BERT remains on top, but Llama is catching up fast!
24,585 次观看
ServiceNow-AI/Apriel-1.5-15b-Thinker running on a single GPU using `transformers serve` 🔥 great to have some very nice reasoning models that can run locally! next step, trying it on mps 👀
14,757 次观看
BOOOM! Both VLMs and LLMs now have a baked-in http server w/ OpenAI spec compatible API in transformers Launch it with `transformers serve` and connect your favorite apps. Here I'm running Open WebUI with local transformers. LLM, VLM, tool calling is in, STT & TTS coming soon!
16,256 次观看
The rise and fall of popular transformer architectures 📈 One pro of having a lot of transformer architectures implemented in transformers & hosted on the Hugging Face Hub is that one can see the evolution of popularity of different models 👀