Video yükleniyor...

Video Yüklenemedi

Bu video yüklenirken bir sorun oluştu. Bu geçici bir ağ sorunundan kaynaklanıyor olabilir veya video kullanılamıyor olabilir.

Ana Sayfaya Dön

Meet EVI 3, another step toward general voice intelligence. EVI 3 is a speech-language model that can understand and generate any human voice, not just a handful of speakers. With this broader voice intelligence comes greater expressiveness and a deeper understanding of tune, rhythm, timbre, and speaking style.

Hume AI

23,199 subscribers

832,583 görüntüleme • 1 yıl önce •via X (Twitter)

Bilim & Teknoloji

Anya Rossi• Live Now

Private livecam show

12 Yorum

Hume profil fotoğrafı

Hume1 yıl önce

Try it at 🤖

Hume profil fotoğrafı

Hume1 yıl önce

EVI 3 can transform its delivery implicitly or on command—it stammers anxiously, debates enthusiastically, and whispers intimately. The model can also generate any voice and personality from your prompt in under a second. Speak to a “raspy Australian history buff,” “sassy British prankster,” or “excited Caribbean musician.”

Hume profil fotoğrafı

Hume1 yıl önce

EVI 3 sits at the frontier of low-latency intelligence, thanks to our latest voice-to-voice architecture. It has the same intelligence as a frontier LLM of similar size but adds voice-to-voice capability with minimal latency overhead, making it faster and more efficient than a cascaded system with separate LLM and voice models.

Hume profil fotoğrafı

Hume1 yıl önce

We developed new architectures for our speech-LLM, encoder, and decoder. Instead of fine-tuning for individual speakers, we used reinforcement learning to identify and hone the preferred qualities of all human voices. Major upgrades to our platform for large-scale psychological data collection. And we developed a new streaming approach to achieve conversational latency. Our research paid off. In blind tests, users consistently preferred EVI 3 for: 🧠Emotion understanding 💬Natural conversational flow 🎭 Voice quality and expressiveness ⚡Response speed

Hume profil fotoğrafı

Hume1 yıl önce

Finally, EVI 3 communicates with larger models as it’s speaking to you, allowing it to respond with the most up-to-date knowledge and reasoning. Read more about our research at

SecBriefs | Making Cybersecurity Simple profil fotoğrafı

SecBriefs | Making Cybersecurity Simple1 yıl önce

🤝 Interviewing with HR or non-technical managers? 🌟 Employers value clarity and confidence. Cybersecurity Dictionary for Everyone teaches you to explain cybersecurity concepts in a way everyone can understand. Build connections that matter! 🌟

Linus Ekenstam profil fotoğrafı

Linus Ekenstam1 yıl önce

this is good, it’s now becoming more and more clear that we are on a path towards complete convergence for AGI, we need a lot of pieces in that puzzle and this is certainly on route to be one of them.

Hume profil fotoğrafı

Hume1 yıl önce

voice AI with emotional intelligence will be so important

Haider. profil fotoğrafı

Haider.1 yıl önce

this is excellent 👍🏻

Hume profil fotoğrafı

Hume1 yıl önce

thank you!

𝐌𝐲𝐬𝐭𝐫𝐚 ✨ profil fotoğrafı

𝐌𝐲𝐬𝐭𝐫𝐚 ✨1 yıl önce

Vibes 🍓

Hume profil fotoğrafı

Hume1 yıl önce

👀

Benzer Videolar

Introducing Empathic Voice Interface 2 (EVI 2), our new voice-to-voice foundation model. EVI 2 merges language and voice into a single model trained specifically for emotional intelligence. You can try it and start building today.

Introducing Empathic Voice Interface 2 (EVI 2), our new voice-to-voice foundation model. EVI 2 merges language and voice into a single model trained specifically for emotional intelligence. You can try it and start building today.

Hume AI

165,616 görüntüleme • 1 yıl önce

This is the most realistic voice cloning I’ve seen. It doesn’t just copy the voice — it captures tone, rhythm, and emotion. Built with Hume’s EVI 3 speech-to-speech model and it even works with external LLMs like Groq, Anthropic, and DeepSeek. Just listen to this Ricky Gervais voice clone — same voice, same feeling 🎧 Try talking to the clone yourself →

This is the most realistic voice cloning I’ve seen. It doesn’t just copy the voice — it captures tone, rhythm, and emotion. Built with Hume’s EVI 3 speech-to-speech model and it even works with external LLMs like Groq, Anthropic, and DeepSeek. Just listen to this Ricky Gervais voice clone — same voice, same feeling 🎧 Try talking to the clone yourself →

Arsalan

26,923 görüntüleme • 11 ay önce

Meet Hume’s Empathic Voice Interface (EVI), the first conversational AI with emotional intelligence.

Meet Hume’s Empathic Voice Interface (EVI), the first conversational AI with emotional intelligence.

Hume AI

875,306 görüntüleme • 2 yıl önce

EVI, the frontier voice AI with emotional intelligence, is now a lot smarter—and available as an iOS app! 📲 Featuring a bold new and improved AI voice named Kora 💁‍♀️and integrating Claude 3.5 Sonnet into its responses, EVI is ready to listen, answer, and explore →

EVI, the frontier voice AI with emotional intelligence, is now a lot smarter—and available as an iOS app! 📲 Featuring a bold new and improved AI voice named Kora 💁‍♀️and integrating Claude 3.5 Sonnet into its responses, EVI is ready to listen, answer, and explore →

Hume AI

27,546 görüntüleme • 2 yıl önce

Voice Design v3 is here. Create any voice you can imagine with a prompt. We’ve rebuilt the underlying Voice Design model to deliver higher quality and broader expressive range. Generate production-ready voices in 70+ languages with support for hundreds of localized accents.

Voice Design v3 is here. Create any voice you can imagine with a prompt. We’ve rebuilt the underlying Voice Design model to deliver higher quality and broader expressive range. Generate production-ready voices in 70+ languages with support for hundreds of localized accents.

ElevenLabs

153,848 görüntüleme • 1 yıl önce

Introduce OpenVoice V2 - a Text-to-Speech model that can clone any voice and speak in any language. Developed by MyShell and MIT CSAIL researchers. 🌐 Imagine your voice going global in multiple languages. 🔊 OpenVoice V2 breaks the language barrier and redefines voice interaction.

Introduce OpenVoice V2 - a Text-to-Speech model that can clone any voice and speak in any language. Developed by MyShell and MIT CSAIL researchers. 🌐 Imagine your voice going global in multiple languages. 🔊 OpenVoice V2 breaks the language barrier and redefines voice interaction.

MyShell.AI

292,391 görüntüleme • 2 yıl önce

3. Speech to Speech Record your own voice or upload a voice file and this tool will mirror voice with same accent and emotions.

3. Speech to Speech Record your own voice or upload a voice file and this tool will mirror voice with same accent and emotions.

Sehaj Singh

23,353 görüntüleme • 10 ay önce

MIT CSAIL and MyShell.AI researchers introduce OpenVoice V2, a text-to-speech model that can clone any voice and speak in many languages. Imagine your voice going global in multiple languages. OpenVoice V2 breaks the language barrier and redefines voice interactions.

MIT CSAIL and MyShell.AI researchers introduce OpenVoice V2, a text-to-speech model that can clone any voice and speak in many languages. Imagine your voice going global in multiple languages. OpenVoice V2 breaks the language barrier and redefines voice interactions.

MIT CSAIL

66,460 görüntüleme • 2 yıl önce

I taught a speech model to understand context in conversation. This is what happened It adjusts voice and tone to express urgency, comfort, understanding from the dialogue. Just like a real human being 520M model. Runs locally on consumer devices How this is achieved 🧵

I taught a speech model to understand context in conversation. This is what happened It adjusts voice and tone to express urgency, comfort, understanding from the dialogue. Just like a real human being 520M model. Runs locally on consumer devices How this is achieved 🧵

Luozhu

21,332 görüntüleme • 4 ay önce

This is live. No auto tune, just pure talent and a voice of liquid gold.

This is live. No auto tune, just pure talent and a voice of liquid gold.

Emotion & Music

16,493 görüntüleme • 25 gün önce

HOLY CRAP, a new super tiny 1.6B param voice model just dropped that seems to.. outperform 11labs!? 😵‍💫 From Nari-labs, Dia is an Apache 2.0 voice model, that can generate laughs, sniffs and emotions, copy an existing voice and is effectively real time on larger GPUs:

HOLY CRAP, a new super tiny 1.6B param voice model just dropped that seems to.. outperform 11labs!? 😵‍💫 From Nari-labs, Dia is an Apache 2.0 voice model, that can generate laughs, sniffs and emotions, copy an existing voice and is effectively real time on larger GPUs:

Alex Volkov

525,803 görüntüleme • 1 yıl önce

Today, we’re proud to unveil a pioneering partnership with @Groq. Our Dialog voice model now runs on GroqCloud. This isn't just an upgrade; this is a leap forward that redefines what voice technology can be—fast, powerful, and beautifully human.

Today, we’re proud to unveil a pioneering partnership with @Groq. Our Dialog voice model now runs on GroqCloud. This isn't just an upgrade; this is a leap forward that redefines what voice technology can be—fast, powerful, and beautifully human.

PlayAI

155,989 görüntüleme • 1 yıl önce

In the gospel workshop I hosted about hearing God’s voice in an age of artificial intelligence, I invited viewers to take specific actions to deepen their relationships with God, self, others, and the natural world and environment around us. In a world of accelerating technology and artificial intelligence, may we never lose the divine intelligence that matters most—the voice of God. Learn more about hearing God’s voice in an age of artificial intelligence:

In the gospel workshop I hosted about hearing God’s voice in an age of artificial intelligence, I invited viewers to take specific actions to deepen their relationships with God, self, others, and the natural world and environment around us. In a world of accelerating technology and artificial intelligence, may we never lose the divine intelligence that matters most—the voice of God. Learn more about hearing God’s voice in an age of artificial intelligence:

Gerrit W. Gong

13,825 görüntüleme • 19 gün önce

Introducing Typeless writing assistance. Today begins a world where your voice can do anything to any text. It’s another step toward our vision: Your voice as superpowers. Say it, and it happens.

Introducing Typeless writing assistance. Today begins a world where your voice can do anything to any text. It’s another step toward our vision: Your voice as superpowers. Say it, and it happens.

Huang Song

464,515 görüntüleme • 9 ay önce

So you can clone any voice 100% locally using this new open source model?! Alibaba has released Qwen3-TTS on Hugging Face. You can easily: - Create custom voices - Clone any voice from a VERY short audio - Generate speech with style instructions Only 0.6B & 1.8B! Sound on🔊

So you can clone any voice 100% locally using this new open source model?! Alibaba has released Qwen3-TTS on Hugging Face. You can easily: - Create custom voices - Clone any voice from a VERY short audio - Generate speech with style instructions Only 0.6B & 1.8B! Sound on🔊

Paul Couvert

148,975 görüntüleme • 5 ay önce

omg.. this is crazy HeyGen just dropped Voice Mirroring, it can clone anyone's voice emotion, tone and style PERFECTLY no words, just watch step by step tutorial:

omg.. this is crazy HeyGen just dropped Voice Mirroring, it can clone anyone's voice emotion, tone and style PERFECTLY no words, just watch step by step tutorial:

el.cine

224,397 görüntüleme • 1 yıl önce

You can use Cerebras' gpt-oss-120b to build realistic speech-to-speech voice interfaces with emotion via Hume AI's new EVI 3. Perfect for your next voice ai project! Link to try below 👇

You can use Cerebras' gpt-oss-120b to build realistic speech-to-speech voice interfaces with emotion via Hume AI's new EVI 3. Perfect for your next voice ai project! Link to try below 👇

Cerebras

22,034 görüntüleme • 10 ay önce

Apparently there is a new Voice-Filter feature in TikTok which tries very hard to generate a human voice, so people run it with all sorts of non-human sources and it's kind of hilarious. Here is a link to the Hashtag:

Apparently there is a new Voice-Filter feature in TikTok which tries very hard to generate a human voice, so people run it with all sorts of non-human sources and it's kind of hilarious. Here is a link to the Hashtag:

Doron Adler

367,008 görüntüleme • 2 yıl önce

This is not being talked about enough Zonos is a new open-source voice AI model that clones any voice in under 10 seconds. Here is how I made a voice clone of Matt Wolfe !

This is not being talked about enough Zonos is a new open-source voice AI model that clones any voice in under 10 seconds. Here is how I made a voice clone of Matt Wolfe !

Miguel | AP

85,318 görüntüleme • 1 yıl önce