Video yükleniyor...

Video Yüklenemedi

Ana Sayfaya Dön

We just solved text-to-speech AI. This model can simulate perfect emotion, screaming and show genuine alarm. — clearly beats 11 labs and Sesame — it’s only 1.6B params — streams realtime on 1 GPU — made by a 1.5 person team in Korea!! It's called Dia by Nari Labs.

709,864 görüntüleme • 1 yıl önce •via X (Twitter)

11 Yorum

Deedy profil fotoğrafı
Deedy1 yıl önce

Source:

Deedy profil fotoğrafı
Deedy1 yıl önce

The future is about to look really weird. Audio may have just crossed the uncanny valley (like parts of text and Ike have) into most-humans-wont-know-this-is-AI territory

MightyBot profil fotoğrafı
MightyBot1 yıl önce

🧠 Unified Search. Smarter Meetings. Effortless CRM. MightyBot is your AI agent platform for seamless workflows—record meetings, automate CRM updates, and find answers across apps in seconds. 🌟 Focus on what matters. We'll handle the grind.

Yuchen Jin profil fotoğrafı
Yuchen Jin1 yıl önce

what is the 0.5 person in the 1.5 person team? 😂

Deedy profil fotoğrafı
Deedy1 yıl önce

Part time research engineer!

Mudit Juneja profil fotoğrafı
Mudit Juneja1 yıl önce

Who are we here? Are you tied to this project?

Deedy profil fotoğrafı
Deedy1 yıl önce

We = humanity

Cr33d profil fotoğrafı
Cr33d1 yıl önce

1.5 people?! Did the 0.5 person just handle the screaming?

Rithik Chopra profil fotoğrafı
Rithik Chopra1 yıl önce

Damn that’s crazy!!!

Albert Sebastian profil fotoğrafı
Albert Sebastian1 yıl önce

whats your take on hume ai?

Cr33d profil fotoğrafı
Cr33d1 yıl önce

Perfect emotion? Finally, my toaster can apologize for burning my toast! 😂

Benzer Videolar