
Hume AI
@hume_ai • 23,257 subscribers
The empathic AI research lab. Providing the open source models, datasets, and evaluation APIs to embed emotional intelligence into your models.
Shorts
Videos

Introducing Octave 2: our next-generation multilingual text-to-speech model What’s new: - Fluent in 11+ languages - 40% faster (<200ms latency) & 50% cheaper than Octave 1 - Multi-speaker conversation - More reliable pronunciation - New voice conversion & phoneme editing capabilities For the month of October, we’re offering 50% off our Creator plan - use code OCTAVE2 at checkout!
Hume AI7,077,015 次观看 • 8 个月前

Today we're releasing our first open source TTS model, TADA! TADA (Text Audio Dual Alignment) is a speech-language model that generates text and audio in one synchronized stream to reduce token-level hallucinations and improve latency. This means: → Zero content hallucinations across 1,000+ test samples → 5x faster than similar-grade LLM-based TTS → Fits much longer audio: 2,048 tokens cover ~700 seconds with TADA vs. ~70 seconds in conventional systems → Free transcript alongside audio with no added latency
Hume AI268,524 次观看 • 3 个月前

Meet EVI 3, another step toward general voice intelligence. EVI 3 is a speech-language model that can understand and generate any human voice, not just a handful of speakers. With this broader voice intelligence comes greater expressiveness and a deeper understanding of tune, rhythm, timbre, and speaking style.
Hume AI832,363 次观看 • 1 年前

Today, we’re releasing Octave: the first LLM built for text-to-speech. 🎨Design any voice with a prompt 🎬 Give acting instructions to control emotion and delivery (sarcasm, whispering, etc.) 🛠️Produce long-form content on our Creator Studio Unlike traditional TTS that just “reads” words aloud, Octave understands how meaning affects delivery to generate emotional, human-like speech.
Hume AI393,658 次观看 • 1 年前

Introducing Voice Control by Hume We developed an experimental voice modulation approach that enables you to create unique AI voices in seconds. Our voice sliders make it intuitive to adjust base voices along 10 interpretable dimensions including: 👃 Nasality: resonant to nasal 🎼 Masculine/Feminine: from masculine to feminine 🎈 Buoyancy: from deflated to buoyant Check out the sample creations in the thread below 👀
Hume AI200,886 次观看 • 1 年前

Introducing Expressive TTS Arena 🥊🤖🥊 ⚡️ 🥊🤖🥊 Starting with Hume AI vs ElevenLabs, it's a new way to evaluate voice AI systems with natural language instructions + richer text As voice generation systems evolve, we wanted to show an example of an eval system better suited toward cutting edge models👇
Hume AI52,617 次观看 • 1 年前

The EVI API is finally here! With our demo alone, we were surprised so many people felt a connection to the world’s first emotionally intelligent voice AI: ✨ ~100K conversations ⏱️ 10 min average conversation length 💬 3M user messages Start building applications your users will love even more! We can’t wait to see what you create.
Hume AI57,877 次观看 • 2 年前
没有更多内容可加载
