Update dpo_llama2.py to fix 8-bit multi-gpu training bug #1352

fancyerii · 2024-02-22T12:31:44Z

fix "ValueError: You can't train a model that has been loaded in 8-bit precision on a different device than the one you're training on." see #1348

fix "ValueError: You can't train a model that has been loaded in 8-bit precision on a different device than the one you're training on." see huggingface#1348

younesbelkada

Thanks a lot LGTM with one comment - what do you think?

examples/research_projects/stack_llama_2/scripts/dpo_llama2.py

import Accelerator

Update dpo_llama2.py

c91a228

fix "ValueError: You can't train a model that has been loaded in 8-bit precision on a different device than the one you're training on." see huggingface#1348

younesbelkada reviewed Feb 22, 2024

View reviewed changes

examples/research_projects/stack_llama_2/scripts/dpo_llama2.py Show resolved Hide resolved

Update dpo_llama2.py

75e51a6

import Accelerator

fancyerii closed this Feb 22, 2024

fancyerii deleted the fancyerii-patch-1 branch February 22, 2024 13:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update dpo_llama2.py to fix 8-bit multi-gpu training bug #1352

Update dpo_llama2.py to fix 8-bit multi-gpu training bug #1352

fancyerii commented Feb 22, 2024

younesbelkada left a comment

Update dpo_llama2.py to fix 8-bit multi-gpu training bug #1352

Update dpo_llama2.py to fix 8-bit multi-gpu training bug #1352

Conversation

fancyerii commented Feb 22, 2024

younesbelkada left a comment

Choose a reason for hiding this comment