正在加载视频...
视频加载失败
We’ve released our full paper on on the Stable Audio model 💿 arXiv: Code: Metrics: Demo: 🧵
9 条评论

We present the ‘Stable Audio AudioSparx 1.0’ model that can generate long-form, variable-length stereo music and sounds at 44.1kHz. It’s capable of rendering stereo signals of up to 95 sec at 44.1kHz in 8 sec on an A100 GPU 🚀

Not to brag, but Stable Audio outperforms AudioLDM2 and MusicGen—check out the metrics in the paper. That’s not all, it’s great at generating music. Have a listen below ⬇️

Stable Audio can generate long-form music with structure (intro, development and outro) from text prompts.

It can generate stereo sound effects from text prompts.

It's also very good at generating music loops.

Great work @zqevans @jordiponsdotme @dadabots @drscotthawley @ODDsWithTheReal

Cool! Do you plan on releasing weights in the future? Or maybe including this to the

i am obsessed with the continuations that #musicgen is capable of generating based upon my input audio. hoping stable audio is working this in as well text prompting isn't very fun for musicians

How is it trained? Do you only use materials with the consent of the original creator? Your parent company @StabilityAI has argued that they can use creative works to train AI models without consent based on “fair use.” Is that how this product was created? @ednewtonrex
