use 🧨diffusers model #1384

keturn · 2022-11-05T16:55:45Z

→ Moved to #1583

[Can't change the working branch of an existing PR.]

I think the plan is that we keep the public APIs in ldm.invoke.generator stable while swapping out the implementations to be diffusers-based.

That looks like it'll be primarily in the make_image methods of those Generators.

It might be possible to split things up by the different tasks (txt2img, inpainting, etc) to separate PRs? Which I'll be in favor of if that makes smaller PRs, but I don't know yet whether that will help that much.

Usage

Add a section to your models.yaml like this:

diffusers-1.5:
  description: Diffusers version of Stable Diffusion version 1.5
  format: diffusers
  repo_name: runwayml/stable-diffusion-v1-5

Note the format: diffusers.
The repo_name is as it appears on huggingface.co.

To Do: txt2img

make sure we use the correct seeded noise

waiting on upstream diffusers

fix seamless. (it looks like it's trying to work, but the padding is off somehow.) see Image output tiling for seamless textures with Stable Diffusion huggingface/diffusers#556
models.diffusion.cross_attention_control might be an obstacle, as that's not in stock diffusers yet and it meddles with some internals. The prompt-to-prompt authors do have a reference implementation that uses diffusers: https://nbviewer.org/github/google/prompt-to-prompt/blob/main/prompt-to-prompt_stable.ipynb

To Do: inpainting

remove this kludge that has something to do with inpainting? https://github.com/keturn/InvokeAI/blob/f49317c4f5e4a29c747aaaa433e617ff8ae98f13/ldm/generate.py#L1004-L1006

discussion thread: https://discord.com/channels/1020123559063990373/1031668022294884392

keturn · 2022-11-05T18:31:19Z

and hey, I already hit the first obstacle to using a stock diffusers pipeline: the stock pipelines take in the prompt as text, but Invoke does its own handling of the text and wants to pass in the data for the CLIP text embeddings instead.

This is fine, diffusers pretty much expects most applications doing anything interesting will hit the point of needing to customize their pipeline anyway. It just means a bit more code is required in order to get even the basic proof-of-concept up.

patrickvonplaten · 2022-11-07T14:43:18Z

Very cool to see that diffusers can be useful to serve as a backend for this library. If you need any help with the migration or require additional features, we're very open to help 🤗

keturn · 2022-11-09T17:32:40Z

Patrick, why do you say that almost like it's a surprise? 😄 Was serving as an application backend not the plan for diffusers all along? Don't make me second-guess myself here. It'll make me look bad in front of the Invoke devs! 🙈

As for what diffusers could do to help, a fine place to start would be the refactoring the StableDiffusionPipeline to aid reusability and extensibility: huggingface/diffusers#551 (comment)

keturn · 2022-11-09T19:59:02Z

I've pushed a proof of concept for txt2img. It is super rough, but it does succeed in producing an image for a prompt.

I've updated this PR's main description with a checklist of things we need to do to support it for real.

patrickvonplaten · 2022-11-09T22:07:19Z

ed a proof of concept for txt2img. It is super rough, but it does succeed in producing an image for a prompt.

I've updated this PR's main description with a checklist of things we need to do to support it for real.

Haha that sounds good - we've starting factoring out methods as done in this PR: huggingface/diffusers#1224 - the __call__ method should get cleaner bit by bit ;-)

and update associated things in Generate & Generator to not instantly fail when that happens

keturn · 2022-11-10T01:32:31Z

Update: made model loading much better. made output much worse.

Like no-longer-recognizable worse. But I committed anyway because it does run, and it's so much easier to fiddle with now that it's not taking extra gigabytes of RAM.

I suspect this implementation of get_learned_conditionings:

InvokeAI/ldm/invoke/generator/diffusers_pipeline.py

Lines 327 to 333 in b39d04d

    
           text_fragments = c[0] 
        
           text_input = self._tokenize(text_fragments) 
        
           with torch.inference_mode(): 
        
               token_ids = text_input.input_ids.to(self.text_encoder.device) 
        
               text_embeddings = self.text_encoder(token_ids)[0] 
        
           return text_embeddings, text_input.input_ids

but maybe it's something else, like configure_model_padding.

keturn · 2022-11-10T03:24:03Z

fixed! I didn't notice it was making 256px images instead of 512.

keturn · 2022-11-10T22:41:00Z

Added initial support for switching schedulers. Some of them look like they need further configuration.

Remove IPNDM scheduler; it is not behaving.

keturn · 2022-11-10T23:29:59Z

Found the missing bit. k_lms and k_euler schedulers fixed.

# Conflicts: # ldm/invoke/model_cache.py # setup.py

keturn · 2022-11-23T19:59:00Z

The current test failure seems to be the same as the failure in development rather than anything specific to this PR.

We get to remove some code by using methods that were factored out in the base class.

# Conflicts: # ldm/invoke/generator/diffusers_pipeline.py

now that we can use it directly from diffusers 0.8.1

keturn · 2022-11-24T04:57:52Z

Pushed support for img2img. Seems to be working, at least with DDIM. LMS and Euler don't do so well.

Might be a few things to follow up on to get proper reproducible-with-seed results.

# Conflicts: # .github/workflows/test-invoke-conda.yml

The RunwayML models still do.

# Conflicts: # ldm/invoke/generator/base.py # ldm/invoke/generator/inpaint.py # ldm/invoke/generator/omnibus.py

Models in the CompVis and stabilityai repos no longer require them. (But runwayml still does.)

# Conflicts: # .github/workflows/test-invoke-conda.yml # .github/workflows/test-invoke-pip.yml

# Conflicts: # environments-and-requirements/requirements-base.txt

keturn · 2022-11-27T17:47:19Z

→ Moved to #1583

[Can't change the working branch of an existing PR.]

lint(ldm.invoke.generator): 🚮 remove unused imports

860adf6

keturn mentioned this pull request Nov 5, 2022

add k_dpmpp_2_a and k_dpmpp_2 solvers options #1389

Merged

keturn added 5 commits November 9, 2022 09:32

Merge branch 'development' into dev/diffusers

e6176df

initial commit of DiffusionPipeline class

e7794c0

spike: proof of concept using diffusers for txt2img

d009a09

Merge remote-tracking branch 'keturn/dev/diffusers' into dev/diffusers

1740187

Merge branch 'development' into dev/diffusers

d4ccd08

keturn added 3 commits November 9, 2022 15:23

doc: type hints for Generator

a267b45

refactor(model_cache): factor out load_ckpt

9f5e496

model_cache: add ability to load a diffusers model pipeline

b39d04d

and update associated things in Generate & Generator to not instantly fail when that happens

model_cache: fix model default image dimensions

f49317c

keturn added 3 commits November 10, 2022 13:16

Merge branch 'development' into dev/diffusers

8a19891

txt2img: support switching diffusers schedulers

8db7054

Merge branch 'development' into dev/diffusers

08c62d7

keturn added 2 commits November 10, 2022 15:27

diffusers: let the scheduler do its scaling of the initial latents

1f83920

Remove IPNDM scheduler; it is not behaving.

web server: update image_progress callback for diffusers data

6b586b7

keturn added 3 commits November 11, 2022 09:18

Merge branch 'development' into dev/diffusers

d121406

diffusers: restore prompt weighting feature

7904d0c

diffusers: fix set-sampler error following model switch

fdf2ed2

keturn force-pushed the dev/diffusers branch from 8ec60b3 to fdf2ed2 Compare November 11, 2022 21:17

Merge branch 'development' into dev/diffusers

3da4832

mauwii and others added 2 commits November 21, 2022 17:09

fix typo in setup.py - scripts/preload_models.py

98dacba

Merge remote-tracking branch 'origin/development' into dev/diffusers

9568881

# Conflicts: # ldm/invoke/model_cache.py # setup.py

keturn added 5 commits November 23, 2022 14:46

dev: upgrade to diffusers 0.8 (from 0.7.1)

95848c9

We get to remove some code by using methods that were factored out in the base class.

diffusers integration: support img2img

375f3be

Merge branch 'dev/diffusers-0.8' into dev/diffusers

458533a

# Conflicts: # ldm/invoke/generator/diffusers_pipeline.py

refactor: remove backported img2img.get_timesteps

da5fee4

now that we can use it directly from diffusers 0.8.1

Merge remote-tracking branch 'origin/development' into dev/diffusers

5a2f650

keturn added 3 commits November 24, 2022 17:51

Merge remote-tracking branch 'origin/development' into dev/diffusers

c849b51

# Conflicts: # .github/workflows/test-invoke-conda.yml

CI: use huggingface cache for test-invoke-pip

9f49f54

ci: use diffusers model

afae108

keturn force-pushed the dev/diffusers branch from 2712af1 to afae108 Compare November 25, 2022 04:18

keturn added 9 commits November 24, 2022 20:32

fixup! ci: use diffusers model

1cbfa5e

Merge remote-tracking branch 'origin/development' into dev/diffusers

9ed8d24

dev: upgrade to diffusers 0.9 (from 0.8.1)

5a6f236

lint: correct annotations for Python 3.9.

15e99ae

lint: correct AttributeError.name reference for Python 3.9.

66c5689

CI: prefer diffusers-1.4 because it no longer requires a token

f72b0c8

The RunwayML models still do.

Merge remote-tracking branch 'origin/development' into dev/diffusers

95b914a

# Conflicts: # ldm/invoke/generator/base.py # ldm/invoke/generator/inpaint.py # ldm/invoke/generator/omnibus.py

build: there's yet another place to update requirements?

50ef6ef

configure: try to download models even without token

03c057a

Models in the CompVis and stabilityai repos no longer require them. (But runwayml still does.)

mauwii mentioned this pull request Nov 26, 2022

Fix for extra pip install -e . for mac pip install #1570

Closed

keturn added 4 commits November 26, 2022 16:03

Merge remote-tracking branch 'origin/development' into dev/diffusers

14d8f69

# Conflicts: # .github/workflows/test-invoke-conda.yml # .github/workflows/test-invoke-pip.yml

Merge remote-tracking branch 'origin/development' into dev/diffusers

ce9384f

# Conflicts: # environments-and-requirements/requirements-base.txt

configure: add troubleshooting info for config-not-found

18cd56a

fix(configure): prepend root to config path

a7dd76f

keturn mentioned this pull request Nov 27, 2022

use 🧨diffusers model #1583

Merged

31 tasks

keturn closed this Nov 27, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

use 🧨diffusers model #1384

use 🧨diffusers model #1384

keturn commented Nov 5, 2022 •

edited

Loading

keturn commented Nov 5, 2022

patrickvonplaten commented Nov 7, 2022

keturn commented Nov 9, 2022

keturn commented Nov 9, 2022

patrickvonplaten commented Nov 9, 2022

keturn commented Nov 10, 2022

keturn commented Nov 10, 2022

keturn commented Nov 10, 2022 •

edited

Loading

keturn commented Nov 10, 2022

keturn commented Nov 23, 2022

keturn commented Nov 24, 2022

keturn commented Nov 27, 2022

use 🧨diffusers model #1384

use 🧨diffusers model #1384

Conversation

keturn commented Nov 5, 2022 • edited Loading

→ Moved to #1583

Usage

To Do: txt2img

To Do: txt2img

waiting on upstream diffusers

To Do: inpainting

keturn commented Nov 5, 2022

patrickvonplaten commented Nov 7, 2022

keturn commented Nov 9, 2022

keturn commented Nov 9, 2022

patrickvonplaten commented Nov 9, 2022

keturn commented Nov 10, 2022

keturn commented Nov 10, 2022

keturn commented Nov 10, 2022 • edited Loading

keturn commented Nov 10, 2022

keturn commented Nov 23, 2022

keturn commented Nov 24, 2022

keturn commented Nov 27, 2022

→ Moved to #1583

keturn commented Nov 5, 2022 •

edited

Loading

keturn commented Nov 10, 2022 •

edited

Loading