Video wird geladen...

Video konnte nicht geladen werden

Zur Startseite

Do diffusion models want Perlin-like noise? SDXL-turbo spends the first blocks simply creating Perlin-like noise of different frequencies, amplitudes and orientations. The model ignores the prompt conditioning during these layers.

59,764 Aufrufe • vor 1 Jahr •via X (Twitter)

12 Kommentare

Profilbild von Rudy Gilman
Rudy Gilmanvor 1 Jahr

You can ablate the conditioning prompt info during those layers. The model doesn't care. It's busy building a noise latticework. This infrastructure it's creating here is completely independent of the prompt.

Profilbild von Rudy Gilman
Rudy Gilmanvor 1 Jahr

Ken Perlin made his eponymous noise explicitly so that it would look natural when used for procedural generation of textures and terrains—it's cool that models have learned something similar! Here's link to SDXL-turbo. Will open to a layer showing the Perlin-like noise:

Profilbild von Rainmaker
Rainmakervor 2 Jahren

Can Machine Learning beat the market? Check out this post on my free Substack where I share code and commentary for an XGBoost model and a Random Forest model that both deliver powerful performances.

Profilbild von Ethan is in SF
Ethan is in SFvor 1 Jahr

Have a section here back in the days of guided diffusion where you could init with perlin noise instead of Gaussian and skip the first whole 10-20% of steps!

Profilbild von Rudy Gilman
Rudy Gilmanvor 1 Jahr

oh nice! that was exactly my thinking, you're way ahead!

Profilbild von Diego Porres
Diego Porresvor 1 Jahr

StableDiffusion yearns for Perlin

Profilbild von Rudy Gilman
Rudy Gilmanvor 1 Jahr

excellent! this was exactly the hope of reply I was hoping to get.

Profilbild von Peter Baylies
Peter Bayliesvor 1 Jahr

👍 I wasn't the first, and I'm sure I won't be the last; but yeah, I think a little perlin noise can often help.

Profilbild von FeepingCreature
FeepingCreaturevor 1 Jahr

If you inject randomized perlin noise at these layers, does the output break? That is, does it want that specific noise or just any noise? Or is it doing octave shifting on the input noisemap?

Profilbild von Rudy Gilman
Rudy Gilmanvor 1 Jahr

Great question. The noise maintains scale, orientation and stretch across seeds. Watch what happens when cycle through seeds below. Also note that when change the prompt nothing happens.

Profilbild von Sasha
Sashavor 1 Jahr

it seems to be similar to DiT study shows that conditioning mostly affects on mid blocks

Profilbild von Rudy Gilman
Rudy Gilmanvor 1 Jahr

Nice. Yes I would imagine that's what the research should show. I think often the models just aren't ready yet to bring in the semantic payload.

Ähnliche Videos