正在加载视频...

视频加载失败

We’ve released our full paper on on the Stable Audio model 💿 arXiv: Code: Metrics: Demo: 🧵

132,152 次观看 • 2 年前 •via X (Twitter)

9 条评论

Stable Audio 的头像
Stable Audio2 年前

We present the ‘Stable Audio AudioSparx 1.0’ model that can generate long-form, variable-length stereo music and sounds at 44.1kHz. It’s capable of rendering stereo signals of up to 95 sec at 44.1kHz in 8 sec on an A100 GPU 🚀

Stable Audio 的头像
Stable Audio2 年前

Not to brag, but Stable Audio outperforms AudioLDM2 and MusicGen—check out the metrics in the paper. That’s not all, it’s great at generating music. Have a listen below ⬇️

Stable Audio 的头像
Stable Audio2 年前

Stable Audio can generate long-form music with structure (intro, development and outro) from text prompts.

Stable Audio 的头像
Stable Audio2 年前

It can generate stereo sound effects from text prompts.

Stable Audio 的头像
Stable Audio2 年前

It's also very good at generating music loops.

Stable Audio 的头像
Stable Audio2 年前

Great work @zqevans @jordiponsdotme @dadabots @drscotthawley @ODDsWithTheReal

Ivan Rubachev 的头像
Ivan Rubachev2 年前

Cool! Do you plan on releasing weights in the future? Or maybe including this to the

thecollabagepatch 的头像
thecollabagepatch2 年前

i am obsessed with the continuations that #musicgen is capable of generating based upon my input audio. hoping stable audio is working this in as well text prompting isn't very fun for musicians

neil turkewitz 的头像
neil turkewitz2 年前

How is it trained? Do you only use materials with the consent of the original creator? Your parent company @StabilityAI has argued that they can use creative works to train AI models without consent based on “fair use.” Is that how this product was created? @ednewtonrex

相关视频