Make fsdp ram efficient loading optional #2037

pacman100 · 2023-10-06T10:02:45Z

What does this PR do?

Make fsdp ram efficient loading optional. Certain models are having issues when handling meta devices during pre-trained model loading. Fixes Error downloading wav2vec2 model when using accelerate launch with FSDP #1948 and NotImplementedError: Cannot copy out of meta tensor; no data! - while using FSDP [works with DDP and DeepSpeed] #2031 by making the ram efficient loading optional.

HuggingFaceDocBuilderDev · 2023-10-06T10:07:31Z

The documentation is not available anymore as the PR was closed or merged.

muellerzr

Thanks for the fix! I added a few suggestions to try and make a few points more clear, let's see if we can improve it some :)

src/accelerate/commands/config/cluster.py

src/accelerate/commands/launch.py

muellerzr

Thanks! cc @BenjaminBossan for a secondary :)

BenjaminBossan

Looks good, thanks Sourab. I only have two small nits, up to you if you want to address them.

src/accelerate/utils/launch.py

docs/source/usage_guides/fsdp.md

BenjaminBossan

Great, thanks for enhancing the doc of the parameter.

make fsdp ram efficient loading optional

061153a

pacman100 mentioned this pull request Oct 6, 2023

Make fsdp ram efficient loading optional huggingface/transformers#26631

Merged

pacman100 requested a review from muellerzr October 6, 2023 11:14

pacman100 marked this pull request as ready for review October 6, 2023 11:14

Add documentation

6a11bf9

muellerzr reviewed Oct 6, 2023

View reviewed changes

src/accelerate/commands/config/cluster.py Outdated Show resolved Hide resolved

src/accelerate/commands/launch.py Show resolved Hide resolved

address comments

1799fbc

muellerzr approved these changes Oct 11, 2023

View reviewed changes

muellerzr requested a review from BenjaminBossan October 11, 2023 17:03

BenjaminBossan approved these changes Oct 12, 2023

View reviewed changes

src/accelerate/utils/launch.py Outdated Show resolved Hide resolved

docs/source/usage_guides/fsdp.md Outdated Show resolved Hide resolved

address comments

6618083

BenjaminBossan reviewed Oct 12, 2023

View reviewed changes

docs/source/usage_guides/fsdp.md Outdated Show resolved Hide resolved

pacman100 added 2 commits October 12, 2023 20:18

address comments

8afc304

nit

50e2f7c

BenjaminBossan approved these changes Oct 12, 2023

View reviewed changes

pacman100 merged commit e6d96e5 into huggingface:main Oct 12, 2023
24 checks passed

hiroshi-matsuda-rit mentioned this pull request Oct 25, 2023

Try multi-node training llm-jp/llm-jp-sft#3

Closed

pacman100 mentioned this pull request Nov 8, 2023

NotImplementedError: Cannot copy out of meta tensor; no data! huggingface/transformers#26510

Closed

4 tasks

pacman100 deleted the smangrul/make-fsdp-ram-efficient-loading-optional branch November 16, 2023 11:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make fsdp ram efficient loading optional #2037

Make fsdp ram efficient loading optional #2037

pacman100 commented Oct 6, 2023

HuggingFaceDocBuilderDev commented Oct 6, 2023 •

edited

Loading

muellerzr left a comment

muellerzr left a comment

BenjaminBossan left a comment

BenjaminBossan left a comment

Make fsdp ram efficient loading optional #2037

Make fsdp ram efficient loading optional #2037

Conversation

pacman100 commented Oct 6, 2023

What does this PR do?

HuggingFaceDocBuilderDev commented Oct 6, 2023 • edited Loading

muellerzr left a comment

Choose a reason for hiding this comment

muellerzr left a comment

Choose a reason for hiding this comment

BenjaminBossan left a comment

Choose a reason for hiding this comment

BenjaminBossan left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Oct 6, 2023 •

edited

Loading