正在加载视频...

视频加载失败

🎉Introducing Sonic: Shifting Focus to Global Audio Perception in Portrait Animation 🎶 👉 What's New? 1️⃣ Breathe life into static images! Single image + any audio → speeches, singing, & beyond! 2️⃣ Temporal Audio Learning harnesses global audio context for precise lip-sync & natural expressions 3️⃣ Decoupled Motion Control...

39,079 次观看 • 1 年前 •via X (Twitter)

11 条评论

neb 的头像
neb1 年前

im confused, that the same model than 3 month ago or im wrong ?

AssemblyAI 的头像
AssemblyAI1 年前

Announcing: Our most advanced speech-to-text model goes beyond accuracy to capture the real-world complexity of human conversation and deliver reliable, source-of-truth audio data. Explore Universal-2 updates 👇

MR BIZARRO 的头像
MR BIZARRO1 年前

Tencent > openai.

DataEatsWorld 的头像
DataEatsWorld1 年前

I don’t even have time to breathe and there’s already some new crazy open-source tech dropped by a big Chinese corp.

James 的头像
James1 年前

Non-commercial license means it's not quite game-changing in those verticals. Impressive none-the-less. Sonic has been around for months now, what's new? The git hasn't been touched in 3 months. Just a hype tweet?

Newman 的头像
Newman1 年前

Finally! Lip sync that doesn’t look like a dubbed kung fu movie

Chinmaya Kumar Behera 的头像
Chinmaya Kumar Behera1 年前

the hugging face demo is not working?

Gadgetify 的头像
Gadgetify1 年前

This guy is stepping up his game 😅

Maskai 的头像
Maskai1 年前

Great work guys 🙌🏼

Emily 的头像
Emily1 年前

Look impressive 👏

Hasan 的头像
Hasan1 年前

Well-done

相关视频