Stable Diffusion Samplers: An Updated Study

@Nafnlaus · edit-2 1 year ago

Stable Diffusion Samplers: An Updated Study

@[email protected] · 1 year ago

Hey thanks for this! I’ve been looking for some comparisons with SDXL, this is perfect. I’ve been using mostly DPM++ SDE Karras with low steps and finding it always the best, but it’s interesting that you found that at high steps it’s still high, but not a clear winner. Thanks for sharing this!

P03 Locke · 1 year ago

All of the “a” samplers are ancestor samplers and will not converge on a final image. They will just keep on trying to “fix” the image with different variations. There’s kind of no point in testing them, especially since they almost always have non-ancestral counterparts that work more deterministically.

Also, why is DPM++ 3M not in the conclusions? It seems off to me that the latest version of DPM++ is somehow worse than DPM++ 2M. Also, Euler and DDIM, some of the oldest samplers ever, are rated quite high here, and that already has me questioning the results.

@Nafnlaus · edit-2 1 year ago

This was not a convergence test; it was an “accuracy + aesthetics” test.

There very much is a point to testing the accuracy and aesthetics of samplers, including ancestral ones. Indeed, that’s the entire point. By contrast, the whole point of doing a large number of tests is to compensate for the fact that you’re not going to get the same result with every sampler at every number of steps, and thus a large number of tests offsets the luck of the draw.

I have no sampler named just “DPM++ 3M”. I have three DPM++ 3M samplers: SDE, SDE Karras, and SDE Exponential - the yellow samplers in the graphs.

Anything can “feel off” to you, but this is what the data shows. I had some of my own biases coming into this that got busted in the process (while I wasn’t surprised by DPM++ SDE’s performance, I expected the new samplers to be standouts and old samplers like Euler a to be poor). If you feel the sample size is too small or is somehow biased, by all means contribute - I literally included the spreadsheet link so others could take part! :) Just a caveat: do your best to not mentally keep track of what sampler’s images you’re rating; we want it to be as “blind” of a test as can be reasonably done.