Fix Dropout in Fused LoRA Operations #102

fabianlim · 2024-11-06T08:22:05Z

This PR fixes #97.

update bnb and auto_gptq fused-lora to correctly account for dropout in the backward.
fix tests to correct patch model, and compute and test input gradients for test_adapter_gradients_match_with_attention_layer
unfortunately we cannot test input_grads for GPTQ in test_adapter_gradients_match_with_attention_layer, this is a limitation of unable to properly load a small GPTQ model.

~~In addition we also disable the MLP fused op for certain models, and removed the old fast_quantized_peft plugin.~~ We defer this to another PR

Regerssions

Looks quite good. The node I got is a little slower, but everything seems to be inline.



benchmarks.csv

Signed-off-by: Yu Chin Fabian Lim <[email protected]>

fabianlim added 6 commits November 5, 2024 13:08

remove skip on test now #48 is complete

c06913e

Signed-off-by: Yu Chin Fabian Lim <[email protected]>

fix fusedops test

4b78624

Signed-off-by: Yu Chin Fabian Lim <[email protected]>

fix model patching in test

6b480c3

Signed-off-by: Yu Chin Fabian Lim <[email protected]>

fix test to tail on input grads

2cdd799

Signed-off-by: Yu Chin Fabian Lim <[email protected]>

fix dropout in fused_lora

80ed420

Signed-off-by: Yu Chin Fabian Lim <[email protected]>

fmt + lint

b6a8d21

Signed-off-by: Yu Chin Fabian Lim <[email protected]>

fabianlim changed the title ~~Fix Dropout in~~ Fix Dropout in Fused LoRA Operations Nov 6, 2024

fabianlim requested a review from anhuong November 6, 2024 17:04

fabianlim changed the title ~~Fix Dropout in Fused LoRA Operations~~ Fix Dropout in Fused LoRA Operations, Dsiable MLP Fused Op for non-SwiGLU, Remove Fast Quantized Peft Nov 8, 2024

fabianlim force-pushed the fix/lora-drop branch 2 times, most recently from 263c75e to b6a8d21 Compare November 8, 2024 05:34

fabianlim changed the title ~~Fix Dropout in Fused LoRA Operations, Dsiable MLP Fused Op for non-SwiGLU, Remove Fast Quantized Peft~~ Fix Dropout in Fused LoRA Operations Nov 8, 2024

fabianlim merged commit d767e33 into main Nov 8, 2024
11 checks passed

fabianlim deleted the fix/lora-drop branch November 8, 2024 11:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Dropout in Fused LoRA Operations #102

Fix Dropout in Fused LoRA Operations #102

fabianlim commented Nov 6, 2024 •

edited

Loading

Fix Dropout in Fused LoRA Operations #102

Fix Dropout in Fused LoRA Operations #102

Conversation

fabianlim commented Nov 6, 2024 • edited Loading

Regerssions

fabianlim commented Nov 6, 2024 •

edited

Loading