
Xenova
@xenovacom • 17,207 subscribers
Bringing the power of machine learning to the web. Currently working on Transformers.js (@huggingface 🤗)
Shorts
Videos

NEW: OpenAI releases Privacy Filter, their first open model of 2026! 🤗 Apache-2.0! It's a bidirectional token-classification adaptation of GPT-OSS, trained to mask personally identifiable information (PII) in text. At only 1.5B params, it can even run locally in your browser!
Xenova219,173 次观看 • 1 个月前

Behold... GPT-OSS (20B) running 100% locally in your browser on WebGPU. This shouldn't be possible — but with Transformers.js v4 and ONNX Runtime Web, it is! A new class of AI apps is emerging. Zero-install, infinite distribution. Simply visit a website and run models locally.
Xenova311,285 次观看 • 3 个月前

Introducing Voxtral WebGPU: Real-time speech transcription entirely in your browser. This demo runs Voxtral-Mini-4B, a powerful streaming ASR model from Mistral AI, locally on WebGPU. The model supports 13 languages and is capable of <500 ms latency. Fully private. Zero cost.
Xenova93,558 次观看 • 2 个月前

RF-DETR, the state-of-the-art model series for real-time object detection, can now run 100% locally in your browser on WebGPU with 🤗 Transformers.js v4! The models are Apache-2.0 licensed, making them a perfect fit for both personal and commercial applications. Try the demo 👇
Xenova76,917 次观看 • 3 个月前

Not enough people are talking about NVIDIA's new Nemotron-3-Nano (4B) model! 🤯 Hybrid Mamba + Attention architecture, designed as a unified model for reasoning and non-reasoning tasks. So small and efficient, it can run 100% locally in your web browser at 75 tokens per second.
Xenova50,063 次观看 • 2 个月前

BOOM! 💥 Today I added WebGPU support for Andrej Karpathy's nanochat models, meaning they can run 100% locally in your browser (no server)! The d32 version runs at over 50 tps on my M4 Max 🚀 Pretty wild that you can now deploy AI applications using just a single index.html file 😅
Xenova95,096 次观看 • 7 个月前

IBM just released Granite 4.0, their latest series of small language models! These models excel at agentic workflows (tool calling), document analysis, RAG, and more. 🚀 The "Micro" (3.4B) model can even run 100% locally in your browser on WebGPU, powered by 🤗 Transformers.js!
Xenova82,762 次观看 • 8 个月前

NEW: Google releases FunctionGemma, a lightweight (270M), open foundation model built for creating specialized function calling models! 🤯 To test it out, I built a small game: use natural language to solve fun physics simulation puzzles, running 100% locally in your browser! 🕹️
Xenova58,113 次观看 • 5 个月前