[Core] better support offloading when side loading is enabled. #4855

sayakpaul · 2023-08-31T12:41:29Z

Potentially fixes:

enable_sequential_cpu_offload followed by load_textual_inversion does not handle placement of loaded weights #3922
LoRA (civitai format) with enable_model_cpu_offload #3958

Todo:

Tests

HuggingFaceDocBuilderDev · 2023-08-31T12:52:11Z

The documentation is not available anymore as the PR was closed or merged.

src/diffusers/loaders.py

sayakpaul · 2023-08-31T14:33:38Z

I think for tests, we will need to add them as SLOW tests requiring GPUs. Given the importance of these use cases, I won't mind adding them. Any objections add these new SLOW tests? @patrickvonplaten @williamberman.

williamberman · 2023-09-01T03:57:12Z

src/diffusers/loaders.py

+        # Remove any existing hooks.
+        is_model_cpu_offload = False
+        is_sequential_cpu_offload = False
+        for _, component in self.components.items():
+            if isinstance(component, nn.Module):
+                if hasattr(component, "_hf_hook"):
+                    is_model_cpu_offload = isinstance(getattr(component, "_hf_hook"), CpuOffload)
+                    is_sequential_cpu_offload = isinstance(getattr(component, "_hf_hook"), AlignDevicesHook)
+                    logger.info(
+                        "Accelerate hooks detected. Since you have called `load_textual_inversion()`, the previous hooks will be first removed. Then the textual inversion parameters will be loaded and the hooks will be applied again."
+                    )
+                    remove_hook_from_module(component)
+


Doesn't one of these two hooks styles hook into every sub module as well, so shouldn't one of the checks be recursive?

@muellerzr to help here a bit.

You should be able to do remove_hook_from_module(component, recursive=True)

CC @SunMarc too for a second glance :)

But is it required here? Sorry for not making my comment clear.

I guess trying to understand just what we're aiming to achieve (solid guess based on context, let me know if I'm accurate):

Given a model that may be loaded in via device_map="auto" or some form of device_map

We wish to remove the hooks when wanting to load the full model into memory/remove it from BMI (big model inference)

With the potential of placing it back later

Is this accurate? Otherwise may need a bit more info/context I'm missing somehow

#3922 (comment)

We want to be able to detect if a torch.nn.Module has hooks and we want to remove them. That is the bit relevant to accelerate. Then after loading some auxiliary weights, we want to load the appropriate hooks back in.

Let me know if that helps?

From what I understood from the codebase, if we have is_sequential_cpu_offload, it means that the components were offloaded using cpu_offload which places recursively the hooks on each submodules. In the case of is_model_cpu_offload, we use cpu_offload_from_hook which place only one hook on the module, so that the entire model will be offloaded when another hook is triggered.
I would then suggest using remove_hook_from_module(component, recursive=True) for the first case and remove_hook_from_module(component, recursive=False) for the second case if you don't want to just recursively remove all the hooks for both cases !

src/diffusers/loaders.py

williamberman · 2023-09-01T03:59:49Z

src/diffusers/pipelines/controlnet/pipeline_controlnet_sd_xl.py

+                    logger.info(
+                        "Accelerate hooks detected. Since you have called `load_lora_weights()`, the previous hooks will be first removed. Then the LoRA parameters will be loaded and the hooks will be applied again."
+                    )


I think the log statement might be a bit noisy. It'd be nice if we expected the user to do additional things with the placed accelerate hooks and should be aware if they expected some state to be maintained or something but we definitely don't want the user to touch the hooks.

I think it's relatively simple given the context the message is being raised from. If you have a better suggestion, let me know.

Sorry, I think my main point is the log is a bit noisy given that it leaks what is supposed to be an internal implementation detail, I think it's not really something that should be exposed to an end user

williamberman · 2023-09-01T04:00:56Z

design makes sense! few quick questions

patrickvonplaten · 2023-09-01T12:59:53Z

src/diffusers/loaders.py

+            if isinstance(component, nn.Module):
+                if hasattr(component, "_hf_hook"):
+                    is_model_cpu_offload = isinstance(getattr(component, "_hf_hook"), CpuOffload)
+                    is_sequential_cpu_offload = isinstance(getattr(component, "_hf_hook"), AlignDevicesHook)


patrickvonplaten · 2023-09-01T13:03:06Z

Looks good to me!

sayakpaul · 2023-09-04T10:20:41Z

@patrickvonplaten ready for another round of review.

patrickvonplaten

Good to go for me!

huggingface#4855)" This reverts commit e4b8e79.

…abled… (#4927) Revert "[Core] better support offloading when side loading is enabled. (#4855)" This reverts commit e4b8e79.

…ngface#4855) * better support offloading when side loading is enabled. * load_textual_inversion * better messaging for textual inversion. * fixes * address PR feedback. * sdxl support. * improve messaging * recursive removal when cpu sequential offloading is enabled. * add: lora tests * recruse. * add: offload tests for textual inversion.

…abled… (huggingface#4927) Revert "[Core] better support offloading when side loading is enabled. (huggingface#4855)" This reverts commit e4b8e79.

…ngface#4855) * better support offloading when side loading is enabled. * load_textual_inversion * better messaging for textual inversion. * fixes * address PR feedback. * sdxl support. * improve messaging * recursive removal when cpu sequential offloading is enabled. * add: lora tests * recruse. * add: offload tests for textual inversion.

…abled… (huggingface#4927) Revert "[Core] better support offloading when side loading is enabled. (huggingface#4855)" This reverts commit e4b8e79.

better support offloading when side loading is enabled.

c810d48

sayakpaul requested review from williamberman and patrickvonplaten August 31, 2023 12:41

sayakpaul added 2 commits August 31, 2023 18:12

load_textual_inversion

c14fc20

better messaging for textual inversion.

46b0874

patrickvonplaten reviewed Aug 31, 2023

View reviewed changes

src/diffusers/loaders.py Show resolved Hide resolved

patrickvonplaten reviewed Aug 31, 2023

View reviewed changes

src/diffusers/loaders.py Show resolved Hide resolved

patrickvonplaten reviewed Aug 31, 2023

View reviewed changes

src/diffusers/loaders.py Outdated Show resolved Hide resolved

patrickvonplaten reviewed Aug 31, 2023

View reviewed changes

src/diffusers/loaders.py Outdated Show resolved Hide resolved

sayakpaul added 2 commits August 31, 2023 19:00

fixes

6c842c7

address PR feedback.

2a27542

sayakpaul requested a review from patrickvonplaten August 31, 2023 13:32

sdxl support.

b3fb9a7

improve messaging

773ff91

williamberman reviewed Sep 1, 2023

View reviewed changes

src/diffusers/loaders.py Outdated Show resolved Hide resolved

williamberman reviewed Sep 1, 2023

View reviewed changes

patrickvonplaten reviewed Sep 1, 2023

View reviewed changes

patrickvonplaten approved these changes Sep 1, 2023

View reviewed changes

sayakpaul mentioned this pull request Sep 4, 2023

[Core] LoRA improvements pt. 3 #4842

Merged

sayakpaul added 5 commits September 4, 2023 14:49

Merge branch 'main' into improve-side-offloading

cd2d963

recursive removal when cpu sequential offloading is enabled.

b8b5422

add: lora tests

3d06c51

recruse.

340887e

add: offload tests for textual inversion.

7bcf71d

sayakpaul requested a review from patrickvonplaten September 4, 2023 10:20

patrickvonplaten approved these changes Sep 4, 2023

View reviewed changes

Merge branch 'main' into improve-side-offloading

6b88f4e

williamberman approved these changes Sep 5, 2023

View reviewed changes

sayakpaul merged commit e4b8e79 into main Sep 5, 2023

sayakpaul deleted the improve-side-offloading branch September 5, 2023 01:25

williamberman added a commit to williamberman/diffusers that referenced this pull request Sep 6, 2023

Revert "[Core] better support offloading when side loading is enabled. (

f4881dd

huggingface#4855)" This reverts commit e4b8e79.

williamberman mentioned this pull request Sep 6, 2023

Temp Revert "[Core] better support offloading when side loading is enabled… #4927

Merged

williamberman added a commit that referenced this pull request Sep 9, 2023

Temp Revert "[Core] better support offloading when side loading is en…

2ab1704

…abled… (#4927) Revert "[Core] better support offloading when side loading is enabled. (#4855)" This reverts commit e4b8e79.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Core] better support offloading when side loading is enabled. #4855

[Core] better support offloading when side loading is enabled. #4855

sayakpaul commented Aug 31, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Aug 31, 2023 •

edited

Loading

sayakpaul commented Aug 31, 2023

williamberman Sep 1, 2023

sayakpaul Sep 1, 2023

sayakpaul Sep 1, 2023

muellerzr Sep 1, 2023

sayakpaul Sep 1, 2023

muellerzr Sep 1, 2023

sayakpaul Sep 1, 2023

SunMarc Sep 1, 2023

williamberman Sep 1, 2023

sayakpaul Sep 1, 2023

williamberman Sep 4, 2023

williamberman commented Sep 1, 2023

patrickvonplaten Sep 1, 2023

patrickvonplaten commented Sep 1, 2023

sayakpaul commented Sep 4, 2023

patrickvonplaten left a comment

[Core] better support offloading when side loading is enabled. #4855

[Core] better support offloading when side loading is enabled. #4855

Conversation

sayakpaul commented Aug 31, 2023 • edited Loading

HuggingFaceDocBuilderDev commented Aug 31, 2023 • edited Loading

sayakpaul commented Aug 31, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

williamberman commented Sep 1, 2023

Choose a reason for hiding this comment

patrickvonplaten commented Sep 1, 2023

sayakpaul commented Sep 4, 2023

patrickvonplaten left a comment

Choose a reason for hiding this comment

sayakpaul commented Aug 31, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Aug 31, 2023 •

edited

Loading