正在加载视频...

视频加载失败

加载此视频时出现问题。这可能是由于临时网络问题，或视频可能不可用。

Introducing Real-time Transcription with Speakers! - Step change in accuracy, surpassing top cloud APIs - Faster than real-time on Mac and iPhone - Still under 3 watts when all features are enabled Available in Argmax SDK 2.0 for early access! Benchmarks and details in comments.

argmax

4,491 subscribers

72,819 次观看 • 6 个月前 •via X (Twitter)

新闻政治教育科学技术

Anya Rossi• Live Now

Private livecam show

0 条评论

暂无评论

原始帖子的评论将显示在这里

相关视频

Introducing Real-time Transcription with Nvidia Parakeet - Same top accuracy as file transcription - Best-in-market 160 ms lips-to-screen latency - 744x more cost-efficient compared to cloud APIs - Available in Argmax Pro SDK starting today! Link in comments

Introducing Real-time Transcription with Nvidia Parakeet - Same top accuracy as file transcription - Best-in-market 160 ms lips-to-screen latency - 744x more cost-efficient compared to cloud APIs - Available in Argmax Pro SDK starting today! Link in comments

argmax

58,986 次观看 • 11 个月前

Real-time Transcription with Speakers is now generally available on iOS and macOS! Details for installing or simply testing Argmax SDK 2 are in the comments.

Real-time Transcription with Speakers is now generally available on iOS and macOS! Details for installing or simply testing Argmax SDK 2 are in the comments.

argmax

29,142 次观看 • 4 个月前

Introducing Argmax Local Server Run our state-of-the-art real-time transcription server directly on Mac! 0:31 Feature complete for AI Meeting Notes apps 0:49 Migrate from cloud APIs with 1 line of code 1:05 Fastest speech models with top accuracy 1:31 Other apps do not slow down Available starting today with Python, JavaScript, and Rust clients! Details in comments

Introducing Argmax Local Server Run our state-of-the-art real-time transcription server directly on Mac! 0:31 Feature complete for AI Meeting Notes apps 0:49 Migrate from cloud APIs with 1 line of code 1:05 Fastest speech models with top accuracy 1:31 Other apps do not slow down Available starting today with Python, JavaScript, and Rust clients! Details in comments

argmax

13,003 次观看 • 10 个月前

TTSKit now achieves sub-100ms time-to-first-byte for Qwen3-TTS 1.7b on Apple Silicon! Link to the code repo and details in comments.

TTSKit now achieves sub-100ms time-to-first-byte for Qwen3-TTS 1.7b on Apple Silicon! Link to the code repo and details in comments.

argmax

30,154 次观看 • 4 个月前

Introducing the fastest inpainting ever — Freepik Retouch. Remove, edit, and adjust details of images in real time. Available now for all Premium users and AI Partners.

Introducing the fastest inpainting ever — Freepik Retouch. Remove, edit, and adjust details of images in real time. Available now for all Premium users and AI Partners.

Magnific

84,712 次观看 • 2 年前

ONTbarcoder 2.0: Real-time DNA barcoding with MinION´s R10.4. Now you can find out what species are in your sample, while you are still sequencing. Get more and better barcodes faster than ever.

ONTbarcoder 2.0: Real-time DNA barcoding with MinION´s R10.4. Now you can find out what species are in your sample, while you are still sequencing. Get more and better barcodes faster than ever.

Rudolf Meier @rudolf-meier.bsky.soci

23,177 次观看 • 3 年前

Introducing in collaboration with Google Cloud For the first time agents can discover, access, and pay-per-request for APIs from Google Cloud including Gemini, BigQuery, Vertex AI, and more using stablecoins on Solana. No accounts, no subscriptions, just machine-native commerce.

Introducing in collaboration with Google Cloud For the first time agents can discover, access, and pay-per-request for APIs from Google Cloud including Gemini, BigQuery, Vertex AI, and more using stablecoins on Solana. No accounts, no subscriptions, just machine-native commerce.

Solana Foundation

618,477 次观看 • 1 个月前

Argmax now runs on Google Tensor TPU, the first-ever SDK to harness this edge inference accelerator! Tensor TPU enabled us to deploy billion-scale transformers reliably on Pixel phones without impacting battery life or resource contention with traditional workloads.

Argmax now runs on Google Tensor TPU, the first-ever SDK to harness this edge inference accelerator! Tensor TPU enabled us to deploy billion-scale transformers reliably on Pixel phones without impacting battery life or resource contention with traditional workloads.

argmax

26,957 次观看 • 1 个月前

DNA to RNA real-time speed. Gene Transcription at real-time speed. Transcription is the first step in gene expression. Credit: @drewberryIV &

DNA to RNA real-time speed. Gene Transcription at real-time speed. Transcription is the first step in gene expression. Credit: @drewberryIV &

The Innovation | Medicine

351,704 次观看 • 3 年前

Introducing Swipooor by REDACTED A swipe-powered platform, launching on Abstract, for both degens & normies to pick, play, and profit—all in real-time. Early access from Dec 3. Enter code 'YZ6W2DJ7' to pre-register 👇

Introducing Swipooor by REDACTED A swipe-powered platform, launching on Abstract, for both degens & normies to pick, play, and profit—all in real-time. Early access from Dec 3. Enter code 'YZ6W2DJ7' to pre-register 👇

TenseT.io (prev Redacted)

33,860 次观看 • 1 年前

I just replaced my real estate broker with an AI agent. 🏡 Finds properties 💸 Breaks down mortgage options 📍 Explains neighborhoods Found a cute apartment in SF under $0.01. Built using OpenAI’s Agents SDK + AgentOps 🖇️ for real-time visibility into every step your agents take. 👇 Access in the comments!

I just replaced my real estate broker with an AI agent. 🏡 Finds properties 💸 Breaks down mortgage options 📍 Explains neighborhoods Found a cute apartment in SF under $0.01. Built using OpenAI’s Agents SDK + AgentOps 🖇️ for real-time visibility into every step your agents take. 👇 Access in the comments!

Sri Laasya Nutheti 🖇️

20,851 次观看 • 1 年前

Surreal Cloud beta is live! Build real-time apps faster—start for free. 👉 When you’re ready to take the next step and need something extra, use code WELCOME25 to receive $25.00 in Cloud credits. Don’t delay, expires on January 31st 2025.

Surreal Cloud beta is live! Build real-time apps faster—start for free. 👉 When you’re ready to take the next step and need something extra, use code WELCOME25 to receive $25.00 in Cloud credits. Don’t delay, expires on January 31st 2025.

SurrealDB

14,957 次观看 • 1 年前

Real-time transcription just got a significant upgrade. Universal-3-Pro is now available for streaming — bringing AssemblyAI's most accurate speech model to live audio for the first time. Developers building voice agents, live captioning tools, and real-time analytics pipelines now get three things they've been asking for: 🔹 Best-in-class word error and entity detection across streaming ASR benchmarks 🔹 Real-time speaker labels — know who said what, as it happens 🔹 Superior entity detection for names, places, orgs, and specialized terminology in real-time 🔹 Code-switching and global language coverage built-in

Real-time transcription just got a significant upgrade. Universal-3-Pro is now available for streaming — bringing AssemblyAI's most accurate speech model to live audio for the first time. Developers building voice agents, live captioning tools, and real-time analytics pipelines now get three things they've been asking for: 🔹 Best-in-class word error and entity detection across streaming ASR benchmarks 🔹 Real-time speaker labels — know who said what, as it happens 🔹 Superior entity detection for names, places, orgs, and specialized terminology in real-time 🔹 Code-switching and global language coverage built-in

AssemblyAI

15,016 次观看 • 4 个月前

DiffusionKit now supports Stable Diffusion 3 Medium MLX Python and Core ML Swift Inference work great for on-device inference on Mac! MLX: Core ML: Mac App: Hugging Face Diffusers App (Pending App Store review)

DiffusionKit now supports Stable Diffusion 3 Medium MLX Python and Core ML Swift Inference work great for on-device inference on Mac! MLX: Core ML: Mac App: Hugging Face Diffusers App (Pending App Store review)

argmax

74,009 次观看 • 2 年前

introducing Voice Mode. speak as you draw and get changes in real-time. available now in Krea iPad.

introducing Voice Mode. speak as you draw and get changes in real-time. available now in Krea iPad.

Krea

481,175 次观看 • 4 个月前

Introducing Move Pro 2.0: the next evolution in high-precision motion capture. Local and secure processing, faster performance with dedicated GPUs, cloud scaling options, and second-generation AI models for enhanced accuracy. Perfect for capturing motion from small studios to large stadiums.

Introducing Move Pro 2.0: the next evolution in high-precision motion capture. Local and secure processing, faster performance with dedicated GPUs, cloud scaling options, and second-generation AI models for enhanced accuracy. Perfect for capturing motion from small studios to large stadiums.

Move AI

14,141 次观看 • 1 年前

Introducing WhisperKit

Introducing WhisperKit

argmax

98,397 次观看 • 2 年前

Sarvam Beats GPT-4o: India’s New AI Model Claims Top Spot in Indic Speech Sarvam AI, an Indian startup, recently launched Sarvam Audio, a speech recognition model that claims superior performance over GPT-4o Transcribe on Indic language benchmarks. This development highlights India's push for AI sovereignty in handling local linguistic nuances. Sarvam Audio supports 22 Indian languages from the Eighth Schedule, plus Indian English, with strong handling of code-mixing like Hindi-English blends. It features built-in speaker diarization for up to eight speakers and processes long-form audio such as podcasts or meetings. Trained on the IndicVoices dataset 12,000 hours from over 16,000 speakers across 208 districts it captures real-world noise and spontaneous speech. The model reportedly outperforms GPT-4o Transcribe and Gemini 3 Flash in transcription accuracy (lower Word Error Rate) on IndicVoices benchmarks for unnormalized, normalized, and code-mixed speech. Sarvam attributes this to specialization on Indian accents and patterns, unlike global models trained on Western data. Detailed public benchmarks are pending independent verification. Key Applications 🔴 Call centers and logistics for multilingual transcription. 🔴 Banking, fintech, and e-commerce for customer interactions. 🔴 Podcasts, meetings, and lectures via API for real-time or batch processing. 🔴 This B2B-focused tool aligns with India's IndiaAI Mission, backed by government GPU access for sovereign LLMs. Credit : AIM Networks.

Sarvam Beats GPT-4o: India’s New AI Model Claims Top Spot in Indic Speech Sarvam AI, an Indian startup, recently launched Sarvam Audio, a speech recognition model that claims superior performance over GPT-4o Transcribe on Indic language benchmarks. This development highlights India's push for AI sovereignty in handling local linguistic nuances. Sarvam Audio supports 22 Indian languages from the Eighth Schedule, plus Indian English, with strong handling of code-mixing like Hindi-English blends. It features built-in speaker diarization for up to eight speakers and processes long-form audio such as podcasts or meetings. Trained on the IndicVoices dataset 12,000 hours from over 16,000 speakers across 208 districts it captures real-world noise and spontaneous speech. The model reportedly outperforms GPT-4o Transcribe and Gemini 3 Flash in transcription accuracy (lower Word Error Rate) on IndicVoices benchmarks for unnormalized, normalized, and code-mixed speech. Sarvam attributes this to specialization on Indian accents and patterns, unlike global models trained on Western data. Detailed public benchmarks are pending independent verification. Key Applications 🔴 Call centers and logistics for multilingual transcription. 🔴 Banking, fintech, and e-commerce for customer interactions. 🔴 Podcasts, meetings, and lectures via API for real-time or batch processing. 🔴 This B2B-focused tool aligns with India's IndiaAI Mission, backed by government GPU access for sovereign LLMs. Credit : AIM Networks.

Augadh

43,429 次观看 • 4 个月前

Introducing Primordia, and our revolutionary white-paper. Sound on.🔊 Real utility, real money, in real life. It’s time to change the NFT space forever 🌐 Explore the future of NFTs with Primordia. WP:

Introducing Primordia, and our revolutionary white-paper. Sound on.🔊 Real utility, real money, in real life. It’s time to change the NFT space forever 🌐 Explore the future of NFTs with Primordia. WP:

Moonrunners

88,496 次观看 • 3 年前