正在加载视频...

视频加载失败

new extremely fast text-to-audio model

68,825 次观看 • 1 年前 •via X (Twitter)

10 条评论

Dreaming Tulpa 🥓👑 的头像
Dreaming Tulpa 🥓👑1 年前

this is TangoFlux, a new text-to-audio model that can generate 30 seconds of 44.1kHz audio in just 3.7 seconds on a single A40 GPU project page: code: demo:

Bytescribe 的头像
Bytescribe1 年前

Introducing Vehrbal, the AI that converts audio into SOAP notes! Say goodbye to wasted time and hello to effortless note-taking. Experience the power of fast, simple, and efficient with Vehrbal today.

Nim Eshed 𝕏🦋 的头像
Nim Eshed 𝕏🦋1 年前

Do you know if there is a good to to duplicate voice

Dreaming Tulpa 🥓👑 的头像
Dreaming Tulpa 🥓👑1 年前

not sure how it has evolved, but check out tortoise and t5-tts

michielh.eth 的头像
michielh.eth1 年前

I'm checking it now, need more ebooks in audio format.

Dreaming Tulpa 🥓👑 的头像
Dreaming Tulpa 🥓👑1 年前

don’t think this is gonna hold up for it but report back!

Russ Shimon 的头像
Russ Shimon1 年前

Cool. Going to check it out.

Dreaming Tulpa 🥓👑 的头像
Dreaming Tulpa 🥓👑1 年前

enjoy

Allar Haltsonen 的头像
Allar Haltsonen1 年前

☄️☄️

aivrar 的头像
aivrar1 年前

Doesn't seem to make great music, still promising though for general sound effects.

相关视频