Add imagic to community pipelines #958

MarkRich · 2022-10-24T02:46:38Z

Part of #895 and bigger story #841

Followed the rough code / parameters given here: https://github.com/justinpinkney/stable-diffusion/blob/main/notebooks/imagic.ipynb.

A few notes for reviews:

Seems like a bad idea to modify the stable diffusion pipeline to take text embeddings; that said it wasn't clear how we plan to re-use this pipeline given these custom generated embeddings. Any advice on how to pass these in? Or is modification given reasonable?
Also seems like a bad idea to define the learning rates where I do. That said this is a bit of an odd stable diffusion pipeline since we're training in the __call__, so not sure where these are expected to go.
Similar comment as in Add Composable diffusion to community pipeline examples #951: Test condition given in readme doesn't work locally and need to make a few changes to make it work locally, but I expect this is only an issue w/ local testing and/or I am missing something w.r.t. how these scripts are intended to be tested.

Results:

Requires 24gb of vram and takes about 7-10 minutes on a 3090, though apparently it's 30g vram in 5pm on an a100 in original script. So reasonable performance?

Initial Image:

Prompt: "A photo of Barack Obama smiling with a big grin"

Image from just text embedding:

Image after text embeddings have been optimized:

Final image at alpha = 0.8

Final Image at alpha = 1.5

Final Image at alpha = 2.

Looking forward to any comments!

HuggingFaceDocBuilderDev · 2022-10-24T02:50:18Z

The documentation is not available anymore as the PR was closed or merged.

patrickvonplaten

Great job @MarkRich!

The design is generally fine with me! Also related to #955 - seems like there are multiple use cases for custom text_embeddings already

@patil-suraj could you do a more in-depth review?

patil-suraj

Very cool @MarkRich , thanks a lot for adding the feature

Th pr looking really good! Just left a few nits.

And I'm not sure yet, if we should modify the StableDiffusionPipeline to allow text_embeddinsg, we are discussing it here #955

For now, since we are adding a custom pipeline, I would suggest we could add to functions to the pipeline.

pipeline.train to train the embeddings and mode
pipeline.__call__ or pipeline.generate to generate the images.

wdyt @patrickvonplaten

examples/community/README.md

examples/community/imagic_stable_diffusion.py

patil-suraj · 2022-10-26T13:58:09Z

examples/community/imagic_stable_diffusion.py

+        optimizer = torch.optim.Adam(
+            [text_embeddings],  # only optimize the embeddings
+            lr=embedding_learning_rate,
+        )


Maybe also allow the option to use 8 but optimizer

examples/community/imagic_stable_diffusion.py

src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion.py

MarkRich · 2022-10-27T17:42:59Z

Addressed all your comments @patil-suraj aside from 8-bit optimization which may take slightly longer for to instrument due to an unrelated error. Let me know if you have any other comments!

patil-suraj

Thanks a lot for addressing the comments @MarkRich !Looks good, will give it try now and then merge soon :)

patil-suraj

The example works great, I just two more comments, then it should be good to merge :)

examples/community/imagic_stable_diffusion.py

examples/community/README.md

patrickvonplaten

Looks good! Thanks a lot :-)
@patil-suraj feel free to merge whenever

…on for imagic pipeline

MarkRich · 2022-11-01T04:28:24Z

Updated with comments from the review; should be good to go!

patil-suraj · 2022-11-01T10:17:46Z

Thanks a lot @MarkRich ! The tests failures are unrelated, merging!

zhongyi-zhou · 2023-03-14T08:04:11Z

@MarkRich Thanks for the amazing code!
I have one question on this line:

diffusers/examples/community/imagic_stable_diffusion.py

Line 312 in f9cfb5a

noise_pred = self.unet(noisy_latents, timesteps, text_embeddings).sample

Why it is text_embeddings instead of text_embeddings_orig?

According to the Imagic paper, in the "model fine-tuning" stage, it says

"We fine-tune them with the same reconstruction loss, but conditioned on $e_{tgt}$, as $e_{opt}$ is optimized for the base model only" (Page 5; the second column)

From my understanding $e_{tgt}$ is text_embeddings_orig and $e_{opt}$ is text_embeddings in this code.
In practice, these two values should be very similar, and probably that's the reason why the code still work amazingly well.

Please correct me if I am wrong. Thanks!

shiranzada · 2023-03-20T12:36:44Z

Thanks @zhongyi-zhou, we condition on e_tgt during finetuning only for the super resolution models of Imagen. This part is not relevant for Stable Diffusion.

For the base model (Imagen-64 or LDM) we condition on e_opt during finetuning rather than on e_tgt in order to overfit the image (for e_opt)

MarkRich mentioned this pull request Oct 24, 2022

[Community Pipeline] Imagic: Text-Based Real Image Editing with Diffusion Models #895

Closed

patil-suraj self-assigned this Oct 24, 2022

patrickvonplaten reviewed Oct 26, 2022

View reviewed changes

patrickvonplaten mentioned this pull request Oct 26, 2022

[Feature Request][Community] Ability to pass text_embeddings/uncond_embeddings as arguments in pipe call #955

Closed

patil-suraj reviewed Oct 26, 2022

View reviewed changes

MarkRich force-pushed the add-imagic-to-community-pipelines branch from 7c97305 to 39d7d9e Compare October 27, 2022 17:27

patil-suraj approved these changes Oct 28, 2022

View reviewed changes

patil-suraj reviewed Oct 28, 2022

View reviewed changes

examples/community/imagic_stable_diffusion.py Outdated Show resolved Hide resolved

examples/community/README.md Outdated Show resolved Hide resolved

patrickvonplaten reviewed Oct 31, 2022

View reviewed changes

examples/community/README.md Show resolved Hide resolved

patrickvonplaten approved these changes Oct 31, 2022

View reviewed changes

MarkRich added 7 commits October 31, 2022 20:27

initial commit to add imagic to stable diffusion community pipelines

1af72fa

remove some testing changes

10349e5

comments from PR review for imagic stable diffusion

9c6bf05

remove changes from pipeline_stable_diffusion as part of imagic pipeline

983e032

clean up example code and add line back in to pipeline_stable_diffusi…

ca8e4c0

…on for imagic pipeline

remove unused functions

9af9c7f

small code quality changes for imagic pipeline

a3448a3

MarkRich force-pushed the add-imagic-to-community-pipelines branch from f499721 to a3448a3 Compare November 1, 2022 03:28

MarkRich added 3 commits October 31, 2022 20:29

clean up readme

fe912d7

remove hardcoded logging values for imagic community example

d41229d

undo change for DDIMScheduler

8d1d60c

patil-suraj merged commit a793b1f into huggingface:main Nov 1, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add imagic to community pipelines #958

Add imagic to community pipelines #958

MarkRich commented Oct 24, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Oct 24, 2022 •

edited

Loading

patrickvonplaten left a comment •

edited

Loading

patil-suraj left a comment

patil-suraj Oct 26, 2022

MarkRich commented Oct 27, 2022

patil-suraj left a comment

patil-suraj left a comment

patrickvonplaten left a comment

MarkRich commented Nov 1, 2022

patil-suraj commented Nov 1, 2022

zhongyi-zhou commented Mar 14, 2023

shiranzada commented Mar 20, 2023 •

edited

Loading

Add imagic to community pipelines #958

Add imagic to community pipelines #958

Conversation

MarkRich commented Oct 24, 2022 • edited Loading

Results:

HuggingFaceDocBuilderDev commented Oct 24, 2022 • edited Loading

patrickvonplaten left a comment • edited Loading

Choose a reason for hiding this comment

patil-suraj left a comment

Choose a reason for hiding this comment

patil-suraj Oct 26, 2022

Choose a reason for hiding this comment

MarkRich commented Oct 27, 2022

patil-suraj left a comment

Choose a reason for hiding this comment

patil-suraj left a comment

Choose a reason for hiding this comment

patrickvonplaten left a comment

Choose a reason for hiding this comment

MarkRich commented Nov 1, 2022

patil-suraj commented Nov 1, 2022

zhongyi-zhou commented Mar 14, 2023

shiranzada commented Mar 20, 2023 • edited Loading

MarkRich commented Oct 24, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Oct 24, 2022 •

edited

Loading

patrickvonplaten left a comment •

edited

Loading

shiranzada commented Mar 20, 2023 •

edited

Loading