argmax's banner

argmax

@argmax • 4,490 subscribers

Frontier Models On Device

Shorts

We are open-sourcing TTSKit! Run state-of-the-art text-to-speech models on your Mac and iPhone. The launch version supports Qwen Qwen3-TTS and generates audio faster than real-time playback with sub-200 ms time-to-first-byte. Voice cloning and advanced speed optimizations will be in the next version. Link to the GitHub repo and models on Hugging Face in comments.

We are open-sourcing TTSKit! Run state-of-the-art text-to-speech models on your Mac and iPhone. The launch version supports Qwen Qwen3-TTS and generates audio faster than real-time playback with sub-200 ms time-to-first-byte. Voice cloning and advanced speed optimizations will be in the next version. Link to the GitHub repo and models on Hugging Face in comments.

61,998 views

WhisperKit-0.7.0 is out! Single file inference is several times faster! The demo below is running distil-whisper large-v3 at 300 tok/s and transcribes 101 seconds of audio in 1 second on an M2 Ultra Mac Studio. Details 🧵 Code (MIT): Demo Audio Input: Demo App: (TestFlight update under review)

WhisperKit-0.7.0 is out! Single file inference is several times faster! The demo below is running distil-whisper large-v3 at 300 tok/s and transcribes 101 seconds of audio in 1 second on an M2 Ultra Mac Studio. Details 🧵 Code (MIT): Demo Audio Input: Demo App: (TestFlight update under review)

50,549 views

Videos

Anya Rossi

sweetdream.ai

SweetDream.ai•Sponsored•Livecam

Watch Anya Live

Anya is streaming live right now! Join her private show and enjoy exclusive content.

Exclusive private shows

1.2k viewers online

Private Show

Join now for exclusive access

Free preview available • Premium content

Introducing Real-time Transcription with Speakers! - Step change in accuracy, surpassing top cloud APIs - Faster than real-time on Mac and iPhone - Still under 3 watts when all features are enabled Available in Argmax SDK 2.0 for early access! Benchmarks and details in comments.

Introducing Real-time Transcription with Speakers! - Step change in accuracy, surpassing top cloud APIs - Faster than real-time on Mac and iPhone - Still under 3 watts when all features are enabled Available in Argmax SDK 2.0 for early access! Benchmarks and details in comments.

72,885 views • 7 months ago

Argmax now runs on Google Tensor TPU, the first-ever SDK to harness this edge inference accelerator! Tensor TPU enabled us to deploy billion-scale transformers reliably on Pixel phones without impacting battery life or resource contention with traditional workloads.

Argmax now runs on Google Tensor TPU, the first-ever SDK to harness this edge inference accelerator! Tensor TPU enabled us to deploy billion-scale transformers reliably on Pixel phones without impacting battery life or resource contention with traditional workloads.

27,205 views • 2 months ago

TTSKit now achieves sub-100ms time-to-first-byte for Qwen3-TTS 1.7b on Apple Silicon! Link to the code repo and details in comments.

TTSKit now achieves sub-100ms time-to-first-byte for Qwen3-TTS 1.7b on Apple Silicon! Link to the code repo and details in comments.

30,154 views • 4 months ago

Introducing Real-time Transcription with Nvidia Parakeet - Same top accuracy as file transcription - Best-in-market 160 ms lips-to-screen latency - 744x more cost-efficient compared to cloud APIs - Available in Argmax Pro SDK starting today! Link in comments

Introducing Real-time Transcription with Nvidia Parakeet - Same top accuracy as file transcription - Best-in-market 160 ms lips-to-screen latency - 744x more cost-efficient compared to cloud APIs - Available in Argmax Pro SDK starting today! Link in comments

58,986 views • 1 year ago

Real-time Transcription with Speakers is now generally available on iOS and macOS! Details for installing or simply testing Argmax SDK 2 are in the comments.

Real-time Transcription with Speakers is now generally available on iOS and macOS! Details for installing or simply testing Argmax SDK 2 are in the comments.

29,142 views • 4 months ago

Introducing WhisperKit

Introducing WhisperKit

98,413 views • 2 years ago

DiffusionKit now supports Stable Diffusion 3 Medium MLX Python and Core ML Swift Inference work great for on-device inference on Mac! MLX: Core ML: Mac App: Hugging Face Diffusers App (Pending App Store review)

DiffusionKit now supports Stable Diffusion 3 Medium MLX Python and Core ML Swift Inference work great for on-device inference on Mac! MLX: Core ML: Mac App: Hugging Face Diffusers App (Pending App Store review)

74,009 views • 2 years ago

Introducing Argmax Local Server Run our state-of-the-art real-time transcription server directly on Mac! 0:31 Feature complete for AI Meeting Notes apps 0:49 Migrate from cloud APIs with 1 line of code 1:05 Fastest speech models with top accuracy 1:31 Other apps do not slow down Available starting today with Python, JavaScript, and Rust clients! Details in comments

Introducing Argmax Local Server Run our state-of-the-art real-time transcription server directly on Mac! 0:31 Feature complete for AI Meeting Notes apps 0:49 Migrate from cloud APIs with 1 line of code 1:05 Fastest speech models with top accuracy 1:31 Other apps do not slow down Available starting today with Python, JavaScript, and Rust clients! Details in comments

13,003 views • 11 months ago

No more content to load