Video wird geladen...

Video konnte nicht geladen werden

Zur Startseite

We just solved text-to-speech AI. This model can simulate perfect emotion, screaming and show genuine alarm. — clearly beats 11 labs and Sesame — it’s only 1.6B params — streams realtime on 1 GPU — made by a 1.5 person team in Korea!! It's called Dia by Nari Labs.

710,230 Aufrufe • vor 1 Jahr •via X (Twitter)

11 Kommentare

Profilbild von Deedy
Deedyvor 1 Jahr

Source:

Profilbild von Deedy
Deedyvor 1 Jahr

The future is about to look really weird. Audio may have just crossed the uncanny valley (like parts of text and Ike have) into most-humans-wont-know-this-is-AI territory

Profilbild von MightyBot
MightyBotvor 1 Jahr

🧠 Unified Search. Smarter Meetings. Effortless CRM. MightyBot is your AI agent platform for seamless workflows—record meetings, automate CRM updates, and find answers across apps in seconds. 🌟 Focus on what matters. We'll handle the grind.

Profilbild von Yuchen Jin
Yuchen Jinvor 1 Jahr

what is the 0.5 person in the 1.5 person team? 😂

Profilbild von Deedy
Deedyvor 1 Jahr

Part time research engineer!

Profilbild von Mudit Juneja
Mudit Junejavor 1 Jahr

Who are we here? Are you tied to this project?

Profilbild von Deedy
Deedyvor 1 Jahr

We = humanity

Profilbild von Cr33d
Cr33dvor 1 Jahr

1.5 people?! Did the 0.5 person just handle the screaming?

Profilbild von Rithik Chopra
Rithik Chopravor 1 Jahr

Damn that’s crazy!!!

Profilbild von Albert Sebastian
Albert Sebastianvor 1 Jahr

whats your take on hume ai?

Profilbild von Cr33d
Cr33dvor 1 Jahr

Perfect emotion? Finally, my toaster can apologize for burning my toast! 😂

Ähnliche Videos