
NEXA AI
@nexa_ai • 4,055 subscribers
On-device AI deployment and research | NexaSDK GitHub: https://t.co/N3ndWl4mqT | Hyperlink App: https://t.co/k4fKc4OjeJ
Shorts
Videos

Meet Hyperlink, the first AI super assistant that lives inside your computer. Your computer stores all your files and personal context. Hyperlink deeply understands them and gives cited answers instantly — like Perplexity for your local files. It turns your computer into a true AI OS, with 100% private on-device AI agent. Welcome to the new interface for personal intelligence.
NEXA AI2,136,912 Aufrufe • vor 7 Monaten

The next generation of mobile apps will run multimodal AI locally by default. Today, we’re making it practical for developers to ship. We just launched NexaSDK for Mobile on Product Hunt. Developers can run the latest multimodal AI models fully on-device in iOS & Android apps with Apple Neural Engine and Qualcomm Hexagon NPU acceleration. In just 3 lines of code, build chat, multimodal, search, and audio features with no cloud cost, complete privacy, 2x faster speed and 9× better energy efficiency. Try out NexaSDK for Mobile and we’d love your feedback (Product Hunt link in thread).
NEXA AI319,337 Aufrufe • vor 6 Monaten

Sam Altman recently said: “GPT-OSS has strong real-world performance comparable to o4-mini—and you can run it locally on your phone.” Many believed running a 20B-parameter model on mobile devices was still years away. At Nexa AI, we’ve built our foundation on deep on-device AI technology—turning that vision into reality. Today, GPT-OSS is running fully local on mobile devices through our app, Nexa Studio. Real performance on Snapdragon Gen 5: - 17 tokens/sec decoding speed - < 3 seconds Time-to-First-Token Developers can now use NexaSDK to build their own local AI apps powered by GPT-OSS. What this unlocks: - Real reasoning models running locally - The next wave of AI Agent use cases—without cloud limits - True data privacy: your data never leaves your device Appreciation to OpenAI Sam Altman Greg Brockman and the entire leadership team for pushing the boundaries of the open-source AI community. Thanks to Qualcomm for the partnership, and to Manoj Khilnani and Chun-Po (Jerry) Chang for the incredible collaboration.
NEXA AI197,453 Aufrufe • vor 8 Monaten

Matthew McConaughey’s dream private LLM? Already exists. It’s called Hyperlink - an offline, private AI agent that knows every file you own. Search files and discover buried insights with it. AI runs 100% local. Hey Matthew, want to try it? Anyone can set up in minutes.
NEXA AI140,797 Aufrufe • vor 8 Monaten

AI PCs are here - yet we still lose 500+hrs/yr chasing scattered files and buried insights. Cloud AI risks your sensitive data. On device AI doesn't. Meet Hyperlink - the fully offline AI agent that instantly searches local folders and unlocks missed ideas with in-text citations.🧠 🚀Public beta available now ↓ (use cases + link below)
NEXA AI130,926 Aufrufe • vor 10 Monaten

For the first time, the latest LLMs run on the Apple Neural Engine — and NexaSDK is the only framework that makes it possible, powered by the NexaML engine. Last year, our two co-founders were invited by Apple DMLI team (Data & Machine Learning Innovation) to share their research about on-device multimodal model for local AI agents. One of the big questions in the room was: “Can the newest LLMs actually run on ANE?” At the time, nobody had a clear path. Today, that path exists. NexaSDK now runs Granite-4.0 (IBM), Qwen3 (Qwen), Gemma3 (Google), and Parakeet-v3 (NVIDIA) fully on Apple’s NPU — unlocking low-power, always-on, fast inference across Mac and iPhone. A new wave of NPU-first local AI apps is coming to Apple devices. Start with one line of code on Mac. iOS SDK coming soon.
NEXA AI30,213 Aufrufe • vor 7 Monaten

Introducing NexaSDK for Android (Beta) — run the latest AI models locally, 9× more energy-efficient and 2× faster, on Android devices, powered by the Qualcomm Hexagon NPU. This is the first SDK to support NPU, GPU and CPU, unlocking the full power of every Android device — for example, LFM2-1.2B achieves 85 t/s on NPU vs 37 t/s on CPU. With just 3 lines of code, you can run the latest state-of-the-art models across every task, for example: Multimodal (Vision, Audio, Text): OmniNeural-4B Embedding: EmbeddingGemma from Google ASR: Parakeet-v3 from NVIDIA OCR: PaddleOCR from Baidu Inc. Rerank: Jina-reranker from Jina AI LLM: LFM2-1.2B from And we continue to deliver Day-0 model support across our framework. Build on-device AI into your Android app today and enjoy no cloud API cost, full privacy, and offline availability. Check out our Quickstart guide and example app below to get started in minutes.
NEXA AI26,091 Aufrufe • vor 7 Monaten
Keine weiteren Inhalte verfügbar