Video wird geladen...
Video konnte nicht geladen werden
new extremely fast text-to-audio model
68,825 Aufrufe • vor 1 Jahr •via X (Twitter)
10 Kommentare

this is TangoFlux, a new text-to-audio model that can generate 30 seconds of 44.1kHz audio in just 3.7 seconds on a single A40 GPU project page: code: demo:

Introducing Vehrbal, the AI that converts audio into SOAP notes! Say goodbye to wasted time and hello to effortless note-taking. Experience the power of fast, simple, and efficient with Vehrbal today.

Do you know if there is a good to to duplicate voice

not sure how it has evolved, but check out tortoise and t5-tts

I'm checking it now, need more ebooks in audio format.

don’t think this is gonna hold up for it but report back!

Cool. Going to check it out.

enjoy

☄️☄️

Doesn't seem to make great music, still promising though for general sound effects.

