LORA + FSDP: ValueError: `FlatParameter` requires uniform `requires_grad` #414

minionKP · 2023-07-02T19:19:51Z

Environment

To reproduce

Steps to reproduce the behavior:

add lora configs to llm-foundry/scripts/train/finetune_example/mpt-7b-arc-easy--gpu.yaml

lora:
  args:
    r: 16
    lora_alpha: 32
    lora_dropout: 0.05
    target_modules: ['Wqkv']

run composer train.py finetune_example/mpt-7b-arc-easy--gpu.yaml

Expected behavior

FSDP + LORA fine-tuning

Config file

No other changes to example config file

The text was updated successfully, but these errors were encountered:

codestar12 · 2023-07-03T00:06:50Z

Hi @minionKP. We do not yet have LoRA working with FSDP. To use LoRA use DDP. We expect to add support eventually, but do not have a solid timeline.

minionKP · 2023-07-03T17:39:37Z

Want to do LORA on MPT-30B model. From another issue #413 looks like QLORA or doing prepare_model_for_int8_training from PEFT does not work. Thanks for communicating the priority.

floschne · 2024-02-02T01:58:49Z

I think this issues not only arises with LoRA but with any mixed setup of frozen and unfrozen parameters. Is there any solution in sight? I know one could write a custom auto-wrap policy when using lightning or pure PyTorch but that seems to be complicated especially because it creates un-uniformly sized FSDPUnits :-/

dakinggg · 2024-02-02T02:01:34Z

Hi @floschne, LoRA + FSDP support is almost merged! You can follow along here if you like: #886. That PR does currently work if you install composer from source.

minionKP added the bug Something isn't working label Jul 2, 2023

vchiley assigned codestar12 and danbider Jul 2, 2023

vchiley mentioned this issue Jul 3, 2023

Edit tutorial comments on PEFT / LoRA #416

Merged

hanlint added enhancement New feature or request and removed bug Something isn't working labels Jul 24, 2023

dakinggg unassigned codestar12 and danbider Feb 2, 2024

dakinggg mentioned this issue Feb 2, 2024

Switch to the Composer integration of LoRA (works with FSDP) #886

Merged

4 tasks

dakinggg closed this as completed in #886 Feb 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LORA + FSDP: ValueError: `FlatParameter` requires uniform `requires_grad` #414

LORA + FSDP: ValueError: `FlatParameter` requires uniform `requires_grad` #414

minionKP commented Jul 2, 2023

codestar12 commented Jul 3, 2023

minionKP commented Jul 3, 2023 •

edited

Loading

floschne commented Feb 2, 2024

dakinggg commented Feb 2, 2024

LORA + FSDP: ValueError: FlatParameter requires uniform requires_grad #414

LORA + FSDP: ValueError: FlatParameter requires uniform requires_grad #414

Comments

minionKP commented Jul 2, 2023

Environment

To reproduce

Expected behavior

Config file

codestar12 commented Jul 3, 2023

minionKP commented Jul 3, 2023 • edited Loading

floschne commented Feb 2, 2024

dakinggg commented Feb 2, 2024

LORA + FSDP: ValueError: `FlatParameter` requires uniform `requires_grad` #414

LORA + FSDP: ValueError: `FlatParameter` requires uniform `requires_grad` #414

minionKP commented Jul 3, 2023 •

edited

Loading