Video yükleniyor...
Video Yüklenemedi
We just solved text-to-speech AI. This model can simulate perfect emotion, screaming and show genuine alarm. — clearly beats 11 labs and Sesame — it’s only 1.6B params — streams realtime on 1 GPU — made by a 1.5 person team in Korea!! It's called Dia by Nari Labs.
709,864 görüntüleme • 1 yıl önce •via X (Twitter)
11 Yorum

Source:

The future is about to look really weird. Audio may have just crossed the uncanny valley (like parts of text and Ike have) into most-humans-wont-know-this-is-AI territory

🧠 Unified Search. Smarter Meetings. Effortless CRM. MightyBot is your AI agent platform for seamless workflows—record meetings, automate CRM updates, and find answers across apps in seconds. 🌟 Focus on what matters. We'll handle the grind.

what is the 0.5 person in the 1.5 person team? 😂

Part time research engineer!

Who are we here? Are you tied to this project?

We = humanity

1.5 people?! Did the 0.5 person just handle the screaming?

Damn that’s crazy!!!

whats your take on hume ai?

Perfect emotion? Finally, my toaster can apologize for burning my toast! 😂

