Change step_offset scheduler docstrings #7128

Beinsezii · 2024-02-28T06:29:17Z

See #6931 and #6068 for background

Right now it changes all the docstrings from

An offset added to the inference steps. You can use a combination of offset=1 and
set_alpha_to_one=False to make the last step use step 0 for the previous alpha product like in Stable
Diffusion.

to simply

An offset added to the inference steps.

Because that really shouldn't apply to any schedulers except DDIM, and in my opinion even that is arguable. set_alpha_to_one=True is basically final_sigmas_type="zero" for DDIM so I'm unsure why it was ever recommended to be disabled?

Whether its from this docstring or otherwise the official SDXL config incorrectly sets set_alpha_to_one=False despite using Euler where it doesn't apply resulting in DDIM performing unusably when inheriting the config. Rest in peace @bghira's early Terminus checkpoints.

"An offset added to the inference steps." isn't very descriptive, but I'm not entirely sure about the motivation behind this value so I'm leaving it as-is for now. Usually if you want to sample later timesteps you'd use a solution like timestep_spacing="trailing" instead.

yiyixuxu

thanks!

yiyixuxu · 2024-03-05T01:25:48Z

cc @pcuenca and @patil-suraj here again to see if anyone knows why we recommended to use steps_offset=1 and set_alpha_to_one=False

see more #6931

HuggingFaceDocBuilderDev · 2024-03-05T01:28:14Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

sayakpaul

Looks good to me. Based on the discussion that happened, I think it makes sense.

I would be curious to know the reasons behind the existing recommendation around steps_offset as well!

Beinsezii · 2024-03-05T04:12:42Z

I almost wonder if it'd be worth to have an anti-recommendation?

Like for final_sigmas_type and set_alpha_to_one have a lil blurb that's something like

Only disable [this] if you're certain it's required.

Even if SAI never updates their goofy SDXL config, it might help people looking in the docs on why the outputs might be bad?

Between plain Epsilon, Turbo, V-Pred, and ZSNR I haven't observed a single model that would require clipping the final step instead of going straight for zero. Maybe that only applies to certain timestep spacings..?

pcuenca · 2024-03-05T13:23:19Z

I've been checking the history a bit and it look like steps_offset was formally introduced in the configuration in #479, motivated by #465. In particular, the different scheduling configurations (at the time) were abstracted to those two properties as summarized by Patrick in this comment: #465 (comment). So the use of set_alpha_to_one=False and steps_offset=1 was required for Stable Diffusion.

In #3947, I think we just went through other schedulers and replicated the same logic. IIRC we checked results against the SDXL implementation, but I'm not completely sure if we used DDIM or Euler as a baseline (I think it was DDIM and then changed to Euler).

Beinsezii · 2024-03-05T21:24:32Z

So the use of set_alpha_to_one=False and steps_offset=1 was required for Stable Diffusion.

Was, as in it no longer does at all? Because I can't find a purpose to either property.

I ran every combination of steps_offset, set_alpha_to_one, and timestep_spacing on both SDXL and SD1.5. All alpha=False seems to do is just make the image noisier which is what you'd expect since the final sigma isn't zero.

Demo with as high of JPG quality as GitHub will allow. You can particularly see the unsampled noise left over in the flat blue sky on any image where set_alpha_to_one=False

For reproduction:

colorful vector art of a fennec fox on mars
blurry, noisy, cropped
seed 1
steps 30
guidance 5
rescale 0
noise added using f32 on CPU

sayakpaul · 2024-03-06T04:03:44Z

@yiyixuxu WDYT of merging this and letting the discussion continue?

pcuenca · 2024-03-06T07:50:17Z

@Beinsezii Thanks a lot for the tests! I don't have an answer unfortunately, maybe a bug was introduced after those PRs, or maybe the problem was always there from the original stable diffusion codebase (I'm pretty sure we compared model outputs when the conversion was made), or maybe the logic in that PR was flawed after all. I did browse the current DDIM code vs the version in the PR and nothing stood out as being obviously wrong. Perhaps we could run generations immediately before and after the change in the PR and see if anything changes. I'd love to help with that, but I'm very time-constrained this week :(

Regarding the doc changes in this PR, I'm supportive if you think they'll help reduce the confusion. It'll be a bit puzzling for new users, perhaps we can improve it a bit without mentioning set_alpha_to_one.

pcuenca · 2024-03-06T07:52:26Z

examples/community/latent_consistency_img2img.py

-            An offset added to the inference steps. You can use a combination of `offset=1` and
-            `set_alpha_to_one=False` to make the last step use step 0 for the previous alpha product like in Stable
-            Diffusion.
+            An offset added to the inference steps.


Suggested change

An offset added to the inference steps.

An offset added to the inference steps, as required by some model families.

How about something like this? Would it still be confusing?

I think it's fair. Curious what @yiyixuxu or @sayakpaul think.

ok let's do this?

pcuenca · 2024-03-06T07:56:54Z

Another idea would be to go back to checking against the reference CompVis codebase, comparing outputs. Our guiding light for integration was to generate 1-to-1 identical results (on CPU using float32), and I think that's the goal we should still have as a library. The test suite should ensure that continues to be the case, though (but it might be imperfect).

Beinsezii · 2024-03-06T20:13:01Z

The CompVis repo has a bunch of pinned dependencies that don't work on my GPU so I can't check that myself.

Maybe something like ComfyUI could also be a decent benchmark? I believe SAI said they were using it for their Discord image gen bot during the SDXL trial period.

These ones failed literal S&R because I performed it case-sensitive which is fun.

sayakpaul · 2024-03-13T15:05:16Z

@yiyixuxu do we wanna merge it?

bghira · 2024-03-13T17:25:47Z

@patrickvonplaten maybe we want to update the SAI configs on the hub once this is merged? it's not blocked by it, this is just a documentation fix. but training / inference code still inherits the bad config.

Beinsezii and others added 2 commits February 27, 2024 22:11

Change step_offset scheduler docstrings

b160ac1

Merge branch 'huggingface:main' into scheduler_docs

aedee6b

Beinsezii marked this pull request as ready for review March 3, 2024 12:27

Merge branch 'main' into scheduler_docs

6a85eb2

sayakpaul requested a review from yiyixuxu March 4, 2024 03:53

yiyixuxu approved these changes Mar 5, 2024

View reviewed changes

yiyixuxu requested a review from sayakpaul March 5, 2024 01:21

sayakpaul approved these changes Mar 5, 2024

View reviewed changes

Merge branch 'main' into scheduler_docs

b0f3b16

Merge branch 'main' into scheduler_docs

028113c

pcuenca reviewed Mar 6, 2024

View reviewed changes

yiyixuxu mentioned this pull request Mar 9, 2024

fixed deprecation error for euler_discrete scheduler steps offset #7252

Open

6 tasks

yiyixuxu added the scheduler label Mar 9, 2024

Beinsezii and others added 3 commits March 9, 2024 14:28

Mention it may be needed by some models

e3bf8ac

Merge branch 'main' into scheduler_docs

d588324

More docstrings

60861e0

These ones failed literal S&R because I performed it case-sensitive which is fun.

pcuenca approved these changes Mar 10, 2024

View reviewed changes

Merge branch 'main' into scheduler_docs

b54e322

sayakpaul mentioned this pull request Mar 13, 2024

DDPMScheduler claims to have set_alpha_to_one in combination with steps_offset #6926

Open

yiyixuxu merged commit d3986f1 into huggingface:main Mar 14, 2024
15 checks passed

Beinsezii mentioned this pull request Apr 3, 2024

DDIM produces incorrect samples with SDXL (epsilon or v-prediction) #6068

Closed

Beinsezii deleted the scheduler_docs branch April 3, 2024 20:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change step_offset scheduler docstrings #7128

Change step_offset scheduler docstrings #7128

Beinsezii commented Feb 28, 2024

yiyixuxu left a comment

yiyixuxu commented Mar 5, 2024

HuggingFaceDocBuilderDev commented Mar 5, 2024

sayakpaul left a comment

Beinsezii commented Mar 5, 2024

pcuenca commented Mar 5, 2024

Beinsezii commented Mar 5, 2024 •

edited

Loading

sayakpaul commented Mar 6, 2024

pcuenca commented Mar 6, 2024

pcuenca Mar 6, 2024

Beinsezii Mar 6, 2024

yiyixuxu Mar 9, 2024

Beinsezii Mar 9, 2024

pcuenca commented Mar 6, 2024

Beinsezii commented Mar 6, 2024

sayakpaul commented Mar 13, 2024

bghira commented Mar 13, 2024

	An offset added to the inference steps.
	An offset added to the inference steps, as required by some model families.

Change step_offset scheduler docstrings #7128

Change step_offset scheduler docstrings #7128

Conversation

Beinsezii commented Feb 28, 2024

yiyixuxu left a comment

Choose a reason for hiding this comment

yiyixuxu commented Mar 5, 2024

HuggingFaceDocBuilderDev commented Mar 5, 2024

sayakpaul left a comment

Choose a reason for hiding this comment

Beinsezii commented Mar 5, 2024

pcuenca commented Mar 5, 2024

Beinsezii commented Mar 5, 2024 • edited Loading

sayakpaul commented Mar 6, 2024

pcuenca commented Mar 6, 2024

pcuenca Mar 6, 2024

Choose a reason for hiding this comment

Beinsezii Mar 6, 2024

Choose a reason for hiding this comment

yiyixuxu Mar 9, 2024

Choose a reason for hiding this comment

Beinsezii Mar 9, 2024

Choose a reason for hiding this comment

pcuenca commented Mar 6, 2024

Beinsezii commented Mar 6, 2024

sayakpaul commented Mar 13, 2024

bghira commented Mar 13, 2024

Beinsezii commented Mar 5, 2024 •

edited

Loading