Загрузка видео...

Не удалось загрузить видео

На главную

We just solved text-to-speech AI. This model can simulate perfect emotion, screaming and show genuine alarm. — clearly beats 11 labs and Sesame — it’s only 1.6B params — streams realtime on 1 GPU — made by a 1.5 person team in Korea!! It's called Dia by Nari Labs.

710,352 просмотров • 1 год назад •via X (Twitter)

Комментарии: 11

Фото профиля Deedy
Deedy1 год назад

Source:

Фото профиля Deedy
Deedy1 год назад

The future is about to look really weird. Audio may have just crossed the uncanny valley (like parts of text and Ike have) into most-humans-wont-know-this-is-AI territory

Фото профиля MightyBot
MightyBot1 год назад

🧠 Unified Search. Smarter Meetings. Effortless CRM. MightyBot is your AI agent platform for seamless workflows—record meetings, automate CRM updates, and find answers across apps in seconds. 🌟 Focus on what matters. We'll handle the grind.

Фото профиля Yuchen Jin
Yuchen Jin1 год назад

what is the 0.5 person in the 1.5 person team? 😂

Фото профиля Deedy
Deedy1 год назад

Part time research engineer!

Фото профиля Mudit Juneja
Mudit Juneja1 год назад

Who are we here? Are you tied to this project?

Фото профиля Deedy
Deedy1 год назад

We = humanity

Фото профиля Cr33d
Cr33d1 год назад

1.5 people?! Did the 0.5 person just handle the screaming?

Фото профиля Rithik Chopra
Rithik Chopra1 год назад

Damn that’s crazy!!!

Фото профиля Albert Sebastian
Albert Sebastian1 год назад

whats your take on hume ai?

Фото профиля Cr33d
Cr33d1 год назад

Perfect emotion? Finally, my toaster can apologize for burning my toast! 😂

Похожие видео