Würstchen model #3849

kashif · 2023-06-22T09:19:48Z

What does this PR do?

Port the Würstchen model to diffusers lib.

Fixes #3134 #3706

src/diffusers/pipelines/wuerstchen/pipeline_wuerstchen_combined.py

…d.py Co-authored-by: Patrick von Platen <[email protected]>

…d.py

…paella

src/diffusers/pipelines/wuerstchen/pipeline_wuerstchen.py

patrickvonplaten · 2023-09-06T12:43:55Z

src/diffusers/pipelines/auto_pipeline.py

@@ -305,8 +308,6 @@ def from_pretrained(cls, pretrained_model_or_path, **kwargs):
        use_auth_token = kwargs.pop("use_auth_token", None)
        local_files_only = kwargs.pop("local_files_only", False)
        revision = kwargs.pop("revision", None)
-        subfolder = kwargs.pop("subfolder", None)


Ideally this should have been fixed in another PR, but doing it here now instead. DiffusionPipeline never consumes neither subfolder nor user_agent so we shouldn't add it to the Auto... pipelines cc @yiyixuxu just FYI

patrickvonplaten · 2023-09-06T14:15:43Z

@sayakpaul I'm merging the PR now - I think all your comments were considered now.

* initial * initial * added initial convert script for paella vqmodel * initial wuerstchen pipeline * add LayerNorm2d * added modules * fix typo * use model_v2 * embed clip caption amd negative_caption * fixed name of var * initial modules in one place * WuerstchenPriorPipeline * inital shape * initial denoising prior loop * fix output * add WuerstchenPriorPipeline to __init__.py * use the noise ratio in the Prior * try to save pipeline * save_pretrained working * Few additions * add _execution_device * shape is int * fix batch size * fix shape of ratio * fix shape of ratio * fix output dataclass * tests folder * fix formatting * fix float16 + started with generator * Update pipeline_wuerstchen.py * removed vqgan code * add WuerstchenGeneratorPipeline * fix WuerstchenGeneratorPipeline * fix docstrings * fix imports * convert generator pipeline * fix convert * Work on Generator Pipeline. WIP * Pipeline works with our diffuzz code * apply scale factor * removed vqgan.py * use cosine schedule * redo the denoising loop * Update src/diffusers/models/resnet.py Co-authored-by: Patrick von Platen <[email protected]> * use torch.lerp * use warp-diffusion org * clip_sample=False, * some refactoring * use model_v3_stage_c * c_cond size * use clip-bigG * allow stage b clip to be None * add dummy * würstchen scheduler * minor changes * set clip=None in the pipeline * fix attention mask * add attention_masks to text_encoder * make fix-copies * add back clip * add text_encoder * gen_text_encoder and tokenizer * fix import * updated pipeline test * undo changes to pipeline test * nip * fix typo * fix output name * set guidance_scale=0 and remove diffuze * fix doc strings * make style * nip * removed unused * initial docs * rename * toc * cleanup * remvoe test script * fix-copies * fix multi images * remove dup * remove unused modules * undo changes for debugging * no new line * remove dup conversion script * fix doc string * cleanup * pass default args * dup permute * fix some tests * fix prepare_latents * move Prior class to modules * offload only the text encoder and vqgan * fix resolution calculation for prior * nip * removed testing script * fix shape * fix argument to set_timesteps * do not change .gitignore * fix resolution calculations + readme * resolution calculation fix + readme * small fixes * Add combined pipeline * rename generator -> decoder * Update .gitignore Co-authored-by: Patrick von Platen <[email protected]> * removed efficient_net * create combined WuerstchenPipeline * make arguments consistent with VQ model * fix var names * no need to return text_encoder_hidden_states * add latent_dim_scale to config * split model into its own file * add WuerschenPipeline to docs * remove unused latent_size * register latent_dim_scale * update script * update docstring * use Attention preprocessor * concat with normed input * fix-copies * add docs * fix test * fix style * add to cpu_offloaded_model * updated type * remove 1-line func * updated type * initial decoder test * formatting * formatting * fix autodoc link * num_inference_steps is int * remove comments * fix example in docs * Update src/diffusers/pipelines/wuerstchen/diffnext.py Co-authored-by: Patrick von Platen <[email protected]> * rename layernorm to WuerstchenLayerNorm * rename DiffNext to WuerstchenDiffNeXt * added comment about MixingResidualBlock * move paella vq-vae to pipelines' folder * initial decoder test * increased test_float16_inference expected diff * self_attn is always true * more passing decoder tests * batch image_embeds * fix failing tests * set the correct dtype * relax inference test * update prior * added combined pipeline test * faster test * faster test * Update src/diffusers/pipelines/wuerstchen/pipeline_wuerstchen_combined.py Co-authored-by: Patrick von Platen <[email protected]> * fix issues from review * update wuerstchen.md + change generator name * resolve issues * fix copied from usage and add back batch_size * fix API * fix arguments * fix combined test * Added timesteps argument + fixes * Update tests/pipelines/test_pipelines_common.py Co-authored-by: Patrick von Platen <[email protected]> * Update tests/pipelines/wuerstchen/test_wuerstchen_prior.py Co-authored-by: Patrick von Platen <[email protected]> * Update src/diffusers/pipelines/wuerstchen/pipeline_wuerstchen_combined.py Co-authored-by: Patrick von Platen <[email protected]> * Update src/diffusers/pipelines/wuerstchen/pipeline_wuerstchen_combined.py Co-authored-by: Patrick von Platen <[email protected]> * Update src/diffusers/pipelines/wuerstchen/pipeline_wuerstchen_combined.py Co-authored-by: Patrick von Platen <[email protected]> * Update src/diffusers/pipelines/wuerstchen/pipeline_wuerstchen_combined.py * up * Fix more * failing tests * up * up * correct naming * correct docs * correct docs * fix test params * correct docs * fix classifier free guidance * fix classifier free guidance * fix more * fix all * make tests faster --------- Co-authored-by: Dominic Rampas <[email protected]> Co-authored-by: Patrick von Platen <[email protected]> Co-authored-by: Dominic Rampas <[email protected]>

kashif added 4 commits June 21, 2023 09:43

initial

119a451

initial

0623199

added initial convert script for paella vqmodel

8a6a92c

initial wuerstchen pipeline

80713a4

kashif marked this pull request as draft June 22, 2023 09:20

kashif and others added 25 commits June 22, 2023 12:08

add LayerNorm2d

8bd6cb8

added modules

806ed12

fix typo

ff6139d

use model_v2

560da3b

embed clip caption amd negative_caption

3acc9fa

fixed name of var

f84ac09

initial modules in one place

25de2c6

WuerstchenPriorPipeline

30e41a5

inital shape

d563218

initial denoising prior loop

d328459

fix output

f0cc379

add WuerstchenPriorPipeline to __init__.py

4c8a791

use the noise ratio in the Prior

ad474b1

try to save pipeline

a79a9ad

save_pretrained working

4c28f9c

Few additions

6e51d7e

Merge branch 'main' into paella

623c1e4

add _execution_device

cd5ad04

shape is int

92c46df

fix batch size

4665e48

fix shape of ratio

58c98b1

fix shape of ratio

66cff25

fix output dataclass

d06276d

tests folder

95eb11e

Merge branch 'main' into paella

2976dd8

patrickvonplaten reviewed Sep 6, 2023

View reviewed changes

src/diffusers/pipelines/wuerstchen/pipeline_wuerstchen_combined.py Outdated Show resolved Hide resolved

kashif and others added 17 commits September 6, 2023 11:34

Update src/diffusers/pipelines/wuerstchen/pipeline_wuerstchen_combine…

a09b4ef

…d.py Co-authored-by: Patrick von Platen <[email protected]>

Update src/diffusers/pipelines/wuerstchen/pipeline_wuerstchen_combine…

1c568ce

…d.py Co-authored-by: Patrick von Platen <[email protected]>

Update src/diffusers/pipelines/wuerstchen/pipeline_wuerstchen_combine…

500cb6e

…d.py

Merge branch 'main' of https://github.com/huggingface/diffusers into …

692a1b7

…paella

Merge branch 'paella' of https://github.com/kashif/diffusers into paella

ad98baf

up

2d222ed

Fix more

d8b62f2

failing tests

d4751ab

up

3b705e8

up

879c82c

up

8490804

correct naming

88032f1

correct docs

fb33746

correct docs

6347957

fix test params

5489081

correct docs

30bc6b6

correct docs

bc8a472

patrickvonplaten reviewed Sep 6, 2023

View reviewed changes

src/diffusers/pipelines/wuerstchen/pipeline_wuerstchen.py Outdated Show resolved Hide resolved

patrickvonplaten force-pushed the paella branch from 8a75fe6 to bc8a472 Compare September 6, 2023 12:21

patrickvonplaten added 4 commits September 6, 2023 14:27

fix classifier free guidance

09787b1

fix classifier free guidance

ed9f96a

fix more

30a86b3

fix all

3f04ada

patrickvonplaten reviewed Sep 6, 2023

View reviewed changes

make tests faster

c35f3f7

patrickvonplaten merged commit 541bb6e into huggingface:main Sep 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Würstchen model #3849

Würstchen model #3849

kashif commented Jun 22, 2023 •

edited

Loading

patrickvonplaten Sep 6, 2023

patrickvonplaten commented Sep 6, 2023

Würstchen model #3849

Würstchen model #3849

Conversation

kashif commented Jun 22, 2023 • edited Loading

What does this PR do?

patrickvonplaten Sep 6, 2023

Choose a reason for hiding this comment

patrickvonplaten commented Sep 6, 2023

kashif commented Jun 22, 2023 •

edited

Loading