
QVAC
@qvac • 7,788 subscribers
Infinite intelligence. Local. Any Hardware. Peer-to-Peer Hyper Swarm. No cloud. No compromise. QVAC is the decentralized AI platform for humans and machines.
Shorts
Videos

Yesterday we announced that the QVAC SDK update unlocked up to 5x more context on your device thanks to TurboQuant. Today, we’ll go through how we got there. TurboQuant (Google Research, ICLR 2026) is a two-stage KV-cache compression algorithm. Stage 1 - PolarQuant: convert KV vectors from Cartesian (x, y, z...) to polar coordinates. Angles compress predictably down to 3-4 bits. Stage 2 - QJL: 1-bit Johnson-Lindenstrauss correction. Cleans up residual error. Total: ~4-5 bits per value. No retraining. No calibration. QVAC ported it to Vulkan inside qvac-fabric-llm.cpp. Currently, TurboQuant is supported only for AMD & NVIDIA GPUs, support for iOS, Android & Apple Silicon coming next. Full algorithm walkthrough + benchmarks + code examples →
QVAC9,652,460 views • 3 days ago

QVAC SDK 0.12.0 is now live, bringing longer context, increased memory optimisation, new modalities, and broader ecosystem support directly to your device. Key Features and Updates: - TurboQuant KV-Cache Quantization: Fit much longer context in the same memory. TurboQuant, an algorithm from Google Research, compresses the KV cache by up to 5x, near-lossless. - Text-to-Video: Generate video from a text prompt, fully local, with the new wan2.1 model in the Diffusion addon - Apple Metal Performance for Flux2-klein: Diffusion on Apple Silicon now matches MLX performance, the native benchmark for Apple GPUs - Robot Control (new VLA addon): A GGML-based Vision-Language-Action addon brings fast, efficient robot control to edge devices - Coding Assistant / Harness Support: QVAC now works with OpenCode and OpenClaw as a local provider. A new @qvac/ai-sdk-provider package automates model registry and provider integration - Cross-Platform Voice: Text-to-speech and Parakeet transcription moved from ONNX to the GGML engine for better CPU and GPU support on macOS, iOS, Windows, Linux, and Android. Parakeet also adds long-term streaming diarization (tracking who spoke when on live audio) - Faster Lightweight Visual Classification: A new GGML-based Classification addon delivers millisecond-level classification, useful where a vision-language model (VLM) would be unnecessarily slow - Under the Hood: Fabric synced to llama.cpp v8828 (from v8189), plus GPU acceleration added to image-upscale models for faster results Full release notes:
QVAC9,849,057 views • 4 days ago

If your AI needs a wire to work, it isn't yours. QVAC is the local-first engine designed to run anywhere, even where the internet can't reach. Fully autonomous. Fully open source. Fully sovereign. If you can dream it, you can build it. Even in another galaxy. Start building the future of edge AI:
QVAC7,043,171 views • 1 month ago

Intelligence should not be a service you rent; it is a foundational element you possess. At Tether, we see AI as a new element of the periodic table - a raw material that can be embedded into the very fabric of the universe. Today, the QVAC SDK is officially live - the atomic unit for the next era of compute. From your smartphone today to the edge of the galaxy tomorrow, we are building the decentralized mind that doesn't require an uplink to function. Infinite Stable Intelligence: - Local-First: Runs privately on any device without permission or central servers. - Single API: A complete SDK for Vision, RAG, P2P networking, and LLM fine-tuning. - Unstoppable: No central point of failure if the internet breaks, your world keeps thinking. - Decentralized: Evolve through Peer-to-Peer Swarms of Infinite Intelligence. The era of Stable Intelligence has begun. Start building the future at
QVAC13,333,357 views • 1 month ago

The engine of the 21st century is here. 🧠 The QVAC SDK is the "steam engine" of the AI era—decoupling intelligence from the cloud and putting it in your hands. A single API for local-first, modular AI that runs anywhere. - Sovereign: Own your engine, don't rent it. - Local: 0 latency, no cloud dependency. - Modular: Stackable, universal building blocks. The era of Stable Intelligence has begun.
QVAC10,662,874 views • 1 month ago

The world of tomorrow cannot run on a rented cloud. 🚫 With 10 billion humans and 10 billion autonomous agents, intelligence must be embedded at the edge - not centralized in a server farm. The QVAC SDK is the invisible engine for this transition. We’ve built the foundational toolkit for the next era: highly efficient, fully modular, and 100% sovereign. From a single light to an industrial grid, the power to build local-first AI is now in your hands. The revolution will not be hosted. It will be local. Learn more:
QVAC2,908,291 views • 21 days ago

QVAC SDK 0.11.0 is live. 🛠️ This release focuses entirely on unlocking next-generation local compute and advanced visual workflows. What’s new: Next-Gen Models: Core engine updated to the latest version of Fabric, unlocking full support for Qwen 3.5, Qwen 3.6, and Gemma 4. Multi-GPU Support: The SDK can now split workloads across multiple graphics cards on the same machine, allowing you to run significantly larger models completely locally. Multi-Image Conditioning: Blend multiple reference images together in a single generation for advanced style mixing and composition control. On-Device Upscaling: Boost your generated images to high-quality resolutions, running securely on your own hardware. More improvements are waiting under the hood. Check the change logs, update your SDK today, and start building with
QVAC2,005,549 views • 18 days ago

The QVAC SDK is the "LEGO block" of the next era of computing. It’s a modular, local-first framework designed to turn anything—from a simple robot to an industrial server—into a sovereign, autonomous mind. Why build with QVAC? Atomic Intelligence: AI as a raw material embedded directly into your hardware. No Cloud Dependency: 0 latency and total privacy. If the internet breaks, your world keeps thinking. Infinite Scale: A single API for local AI that runs on any device, anywhere. From a child’s toy to the fabric of the universe, if you can dream it, you can build it. Start building the future: 🚀
QVAC4,706,897 views • 1 month ago

Superior methodology beats raw parameter count. 🧠 Introducing QVAC MedPsy: Local-first medical AI that redefines the possible. 1/ Unprecedented Power: MedPsy 1.7B model outperforms Google’s MedGemma 4B by 11 points and our 4B model beats MedGemma 27B on real-world health benchmarks. 2/ Extreme Efficiency: 3.2x fewer tokens means near-instant inference on your phone or wearable. 3/ Absolute Privacy: Expert-level reasoning running 100% locally. No data leaves your device. We aren’t simply shrinking models; we’re anchoring intelligence where it matters most. High-level medical logic is now a sovereign right. The future of healthcare is local. Learn more:
QVAC2,415,920 views • 29 days ago

Ready to build the future of stable private on-device AI? 🧠 Our latest tutorial shows you how to build a sovereign mobile app in minutes using the QVAC SDK and Expo. Start from a blank template and deploy in minutes a local Llama 3.2 inference running directly on your own devices. What you’ll learn: Modular Setup: Use the QVAC CLI to tree-shake and keep your mobile bundle lean. Local-First Flow: Initialize the SDK, download weights, and run high-speed inference without a cloud uplink. Cross-Platform Power: See the smoke test in action on a physical Samsung S25. No rented clouds. No API keys. Build local, on-device, unstoppable intelligence in your pocket. Watch the full guide and start building:
QVAC4,080,244 views • 1 month ago

Two islands. Two futures. 🏝️ One chose to trust its people with intelligence. The other turned them into the product. QVAC is the foundation for a sovereign future. No central servers, no "Department of Truth," and no surveillance. Just local-first AI that lives on your device, learns with you, and belongs to you. Your data. Your device. Your freedom. Build the right choice:
QVAC3,599,437 views • 1 month ago

Big Tech's "safe spaces" are a mirage. When your most private thoughts are routed through a server, they are never truly yours; they are flagged, reviewed, and vulnerable. QVAC is the alternative. It’s not just a chatbot; it’s a sovereign intelligence platform that lives entirely on your devices. Truly Private: Data never leaves your hardware. No Gatekeepers: No "Department of Truth," no central surveillance. Unstoppable: Local AI that works for you, and only you. Take back your privacy. Take back your secrets. The era of Stable Intelligence has begun. Build the future:
QVAC2,967,319 views • 1 month ago

The era of Stable Intelligence is here 🤖 Tether’s QVAC Fabric just released the world’s first cross-platform 1-bit LLM LoRA fine-tuning framework. QVAC Fabric extends Microsoft's ultra-efficient BitNet architecture, allowing fine-tuning and inference of LLMs directly on your smartphone—no NVIDIA GPU/CUDA lock-in or expensive server required. The Breakthrough: - Total Sovereignty: LoRA fine-tune ultra-efficient models locally on any smartphone, including iPhones, Pixel phones, Samsung Galaxy phones and any desktop/laptop operating systems using Vulkan and Metal backends. - Extreme Efficiency: 1-bit architecture uses up to 90% less memory and runs up to 11x faster than traditional models. - Universal Access: What used to require a data center now runs on the chip in your pocket. Own your intelligence. The era of stable, local AI is here. 📱🧠 Read the full details on Hugging Face and grab the binaries to build on your own hardware.
QVAC108,379 views • 2 months ago

QVAC SDK 0.10.0 is now live, bringing advanced local compute capabilities and specialized hardware optimization directly to your device Key Features and Updates: - Image-to-Image Diffusion: Transform and edit images using simple prompts with 100% local compute—no cloud uploads or external servers required - Dynamic Tooling & KV Cache Management:Your local LLM now receives a tailored toolbox for every interaction, with automatic KV cache clearing to maintain high-speed inference - Doctor CLI: A new diagnostic tool that analyzes your hardware and memory, providing specific instructions on how to optimize your GPU for local AI - Suspend & Resume API: Specifically designed for mobile environments, this allows apps to pause P2P swarms and RAG workspaces to meet background rules without losing model state - GPT-OSS Compatibility: Added support for the latest GPT-OSS models loaded externally, expanding the range of open-source intelligence available on the platform Build the future of private, unstoppable AI:
QVAC34,043 views • 1 month ago

The QVAC SDK puts the "brain" directly into your pocket. From real-time on-device translation to multimodal understanding, build apps that work everywhere, even 30,000 feet in the air. Local AI is here: 💡Offline-First: No cloud, no latency, no "Department of Truth". 💻 Universal API: One codebase for iOS, Android, macOS, and Linux. 🔍 Multimodal: Understanding text, audio, and images without a server. If you can dream it, you can build it. The era of Stable Intelligence has begun. Start building:
QVAC36,352 views • 1 month ago

Introducing QVAC Health, the all-in-one wellness platform that aggregates your health data from various sources in one, encrypted, offline-capable environment. Say goodbye to fragmented data and hello to a unified, AI-powered wellness timeline! QVAC - Your Device, Your AI
QVAC92,842 views • 5 months ago

Meet Genesis V2. We’ve introduced "Option-Level Reasoning"—a method that trains models to understand why distractors are incorrect, not just pick the right answer. 📈 The Result: ~30% avg accuracy (competitors ~12%). ✅ Reliability: 99.4% valid, clear answers. Read the paper & get the dataset on Hugging Face:
QVAC85,228 views • 5 months ago

QVAC Workbench 0.6.0 is officially live. 🤖 This update marks a major shift toward a more natural, hands-free interface with several key features: Conversation Mode: Enables full voice-to-voice interaction using automated transcription and text-to-speech (TTS), allowing the keyboard to be entirely optional. Automated Model Selection: Streamlines the user experience by intelligently selecting the best local model for your device. Snappier UI: Significant performance improvements to the interface for a faster and more responsive workspace. Learn more:
QVAC24,056 views • 1 month ago

QVAC Health 1.1.0 is officially live! 🏥✨ Your wellness data belongs to you, not the cloud. This latest update, powered by the upgraded QVAC SDK 0.8.0, brings significant performance gains and local-first features to your sovereign health dashboard. What’s New: Calorie Tracking: Log meals and monitor intake directly on-device. Advanced Biomarkers: Weight tracking now includes automatic BMI calculations. Improved Vitals: Organized dashboard and critical fixes for Apple Watch blood oxygen data. Total Privacy: Faster performance with 100% local, encrypted processing. Update today and experience health insights without the surveillance. Build the future:
QVAC20,158 views • 1 month ago