support to disable exllama for gptq #604

winglian · 2023-09-19T13:05:21Z

fixes #599

adding gptq_disable_exllama: true to the yml config should fix the issue with gptq

NanoCode012 · 2023-09-19T13:52:00Z

The author of the linked issue mentions

    raise ValueError('Expected a cuda device, but got: {}'.format(device))
ValueError: Expected a cuda device, but got: cpu

after setting it. Is that due to cuda oom?

Napuh · 2023-09-19T15:09:38Z

Executing the example yaml file in this branch thows an error related to modifying the LlamaConfig object:

  File "/home/axolotl/scripts/finetune.py", line 52, in <module>
    fire.Fire(do_cli)
  File "/opt/conda/lib/python3.10/site-packages/fire/core.py", line 141, in Fire
    component_trace = _Fire(component, args, parsed_flag_args, context, name)
  File "/opt/conda/lib/python3.10/site-packages/fire/core.py", line 475, in _Fire
    component, remaining_args = _CallAndUpdateTrace(
  File "/opt/conda/lib/python3.10/site-packages/fire/core.py", line 691, in _CallAndUpdateTrace
    component = fn(*varargs, **kwargs)
  File "/home/axolotl/scripts/finetune.py", line 48, in do_cli
    train(cfg=parsed_cfg, cli_args=parsed_cli_args, dataset_meta=dataset_meta)
  File "/home/axolotl/src/axolotl/train.py", line 58, in train
    model, peft_config = load_model(cfg, tokenizer, inference=cli_args.inference)
  File "/home/axolotl/src/axolotl/utils/models.py", line 202, in load_model
    model_config["disable_exllama"] = cfg.gptq_disable_exllama
TypeError: 'LlamaConfig' object does not support item assignment
Traceback (most recent call last):
  File "/opt/conda/bin/accelerate", line 8, in <module>
    sys.exit(main())
  File "/opt/conda/lib/python3.10/site-packages/accelerate/commands/accelerate_cli.py", line 47, in main
    args.func(args)
  File "/opt/conda/lib/python3.10/site-packages/accelerate/commands/launch.py", line 986, in launch_command
    simple_launcher(args)
  File "/opt/conda/lib/python3.10/site-packages/accelerate/commands/launch.py", line 628, in simple_launcher
    raise subprocess.CalledProcessError(returncode=process.returncode, cmd=cmd)
subprocess.CalledProcessError: Command '['/opt/conda/bin/python', 'scripts/finetune.py', 'examples/llama-2/gptq-lora.yml']' returned non-zero exit status 1.

Also:

The author of the linked issue mentions
    raise ValueError('Expected a cuda device, but got: {}'.format(device))
ValueError: Expected a cuda device, but got: cpu
after setting it. Is that due to cuda oom?

I doubt is a CUDA memory issue, I was executing on a RTX3090 24GB. Wasn't a fluke either, as the error persisted in two different machines.

winglian · 2023-09-19T19:50:29Z

@Napuh I updated the fix, lmk if that works.

Napuh · 2023-09-19T20:11:33Z

Now it's throwing the same error as in #599, ValueError: Found modules on cpu/disk. Using Exllama backend requires all the modules to be on GPU.You can deactivate exllama backend by setting `disable_exllama=True` in the quantization config object.

Im executing accelerate launch scripts/finetune.py examples/llama-2/gptq-lora.yml, am I doing something wrong?

Also script keeps loading after model downloading for about 5 minutes, and no memory is ever allocated on the gpu (monitored manually via nvidia-smi).

winglian · 2023-09-19T20:39:46Z

@Napuh hopefully this most recent commit resolves it.

Napuh · 2023-09-19T21:11:49Z

@winglian Latest commit solves the config issue with exllama, but now it's throwing ValueError: Expected a cuda device, but got: cpu, the same error as if I edit the config.json manually.

#456 may be related as it's the same error, but the scenario is different from this one.

winglian · 2023-09-19T21:50:57Z

@winglian Latest commit solves the config issue with exllama, but now it's throwing ValueError: Expected a cuda device, but got: cpu, the same error as if I edit the config.json manually.

#456 may be related as it's the same error, but the scenario is different from this one.

#609 should fix the issue for the device check when logging gpu utilization

* support to disable exllama for gptq * update property instead of item * fix config key

support to disable exllama for gptq

91294cf

winglian requested a review from NanoCode012 September 19, 2023 13:05

NanoCode012 approved these changes Sep 19, 2023

View reviewed changes

update property instead of item

a08404b

fix config key

1e38c56

winglian merged commit faecff9 into main Sep 19, 2023
4 checks passed

winglian deleted the gptq-disable-exllama branch September 19, 2023 21:51

mkeoliya pushed a commit to mkeoliya/axolotl that referenced this pull request Dec 15, 2023

support to disable exllama for gptq (axolotl-ai-cloud#604)

7f2b780

* support to disable exllama for gptq * update property instead of item * fix config key

djsaunde pushed a commit that referenced this pull request Dec 17, 2024

support to disable exllama for gptq (#604)

90ea67e

* support to disable exllama for gptq * update property instead of item * fix config key

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support to disable exllama for gptq #604

support to disable exllama for gptq #604

winglian commented Sep 19, 2023

NanoCode012 commented Sep 19, 2023 •

edited

Loading

Napuh commented Sep 19, 2023 •

edited

Loading

winglian commented Sep 19, 2023

Napuh commented Sep 19, 2023 •

edited

Loading

winglian commented Sep 19, 2023

Napuh commented Sep 19, 2023 •

edited

Loading

winglian commented Sep 19, 2023

support to disable exllama for gptq #604

support to disable exllama for gptq #604

Conversation

winglian commented Sep 19, 2023

NanoCode012 commented Sep 19, 2023 • edited Loading

Napuh commented Sep 19, 2023 • edited Loading

winglian commented Sep 19, 2023

Napuh commented Sep 19, 2023 • edited Loading

winglian commented Sep 19, 2023

Napuh commented Sep 19, 2023 • edited Loading

winglian commented Sep 19, 2023

NanoCode012 commented Sep 19, 2023 •

edited

Loading

Napuh commented Sep 19, 2023 •

edited

Loading

Napuh commented Sep 19, 2023 •

edited

Loading

Napuh commented Sep 19, 2023 •

edited

Loading