
Hume AI
@hume_ai • 23,257 subscribers
The empathic AI research lab. Providing the open source models, datasets, and evaluation APIs to embed emotional intelligence into your models.
Shorts
Videos

Introducing Octave 2: our next-generation multilingual text-to-speech model What’s new: - Fluent in 11+ languages - 40% faster (<200ms latency) & 50% cheaper than Octave 1 - Multi-speaker conversation - More reliable pronunciation - New voice conversion & phoneme editing capabilities For the month of October, we’re offering 50% off our Creator plan - use code OCTAVE2 at checkout!
Hume AI7,077,015 Aufrufe • vor 8 Monaten

Today we're releasing our first open source TTS model, TADA! TADA (Text Audio Dual Alignment) is a speech-language model that generates text and audio in one synchronized stream to reduce token-level hallucinations and improve latency. This means: → Zero content hallucinations across 1,000+ test samples → 5x faster than similar-grade LLM-based TTS → Fits much longer audio: 2,048 tokens cover ~700 seconds with TADA vs. ~70 seconds in conventional systems → Free transcript alongside audio with no added latency
Hume AI268,524 Aufrufe • vor 3 Monaten

Meet EVI 3, another step toward general voice intelligence. EVI 3 is a speech-language model that can understand and generate any human voice, not just a handful of speakers. With this broader voice intelligence comes greater expressiveness and a deeper understanding of tune, rhythm, timbre, and speaking style.
Hume AI832,363 Aufrufe • vor 1 Jahr

Meet Hume’s Empathic Voice Interface (EVI), the first conversational AI with emotional intelligence.
Hume AI875,172 Aufrufe • vor 2 Jahren

Today, we’re releasing Octave: the first LLM built for text-to-speech. 🎨Design any voice with a prompt 🎬 Give acting instructions to control emotion and delivery (sarcasm, whispering, etc.) 🛠️Produce long-form content on our Creator Studio Unlike traditional TTS that just “reads” words aloud, Octave understands how meaning affects delivery to generate emotional, human-like speech.
Hume AI393,658 Aufrufe • vor 1 Jahr

You can now control a computer with just your voice. Here’s how we did it: 🧵
Hume AI428,385 Aufrufe • vor 1 Jahr

Introducing Voice Control by Hume We developed an experimental voice modulation approach that enables you to create unique AI voices in seconds. Our voice sliders make it intuitive to adjust base voices along 10 interpretable dimensions including: 👃 Nasality: resonant to nasal 🎼 Masculine/Feminine: from masculine to feminine 🎈 Buoyancy: from deflated to buoyant Check out the sample creations in the thread below 👀
Hume AI200,886 Aufrufe • vor 1 Jahr

OpenAI enters the Expressive TTS Arena 🥊🤖🥊 Now hosted on Hugging Face, this arena is a new way to evaluate voice AI systems with natural language instructions + richer text. Compare Hume's TTS against ElevenLabs and OpenAI and see if you agree with the leaderboard results!
Hume AI75,343 Aufrufe • vor 1 Jahr

Introducing Expressive TTS Arena 🥊🤖🥊 ⚡️ 🥊🤖🥊 Starting with Hume AI vs ElevenLabs, it's a new way to evaluate voice AI systems with natural language instructions + richer text As voice generation systems evolve, we wanted to show an example of an eval system better suited toward cutting edge models👇
Hume AI52,617 Aufrufe • vor 1 Jahr

The EVI API is finally here! With our demo alone, we were surprised so many people felt a connection to the world’s first emotionally intelligent voice AI: ✨ ~100K conversations ⏱️ 10 min average conversation length 💬 3M user messages Start building applications your users will love even more! We can’t wait to see what you create.
Hume AI57,877 Aufrufe • vor 2 Jahren
Keine weiteren Inhalte verfügbar