Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Deepseek #686

Open
ehartford opened this issue Dec 30, 2024 · 2 comments
Open

Deepseek #686

ehartford opened this issue Dec 30, 2024 · 2 comments

Comments

@ehartford
Copy link

Requesting support for Deepseek-V3 and DeepSeek-V2.5-1210

@ehartford
Copy link
Author

ehartford commented Dec 30, 2024

With Deepseek-v3 I get this:

from awq import AutoAWQForCausalLM
from transformers import AutoTokenizer

model_path = '/home/ubuntu/datasets/models/DeepSeek-V3-bf16'
quant_path = '/home/ubuntu/datasets/models/DeepSeek-V3-awq'
quant_config = { "zero_point": True, "q_group_size": 128, "w_bit": 4, "version": "GEMM" }

# Load model
model = AutoAWQForCausalLM.from_pretrained(model_path, trust_remote_code=True)
tokenizer = AutoTokenizer.from_pretrained(model_path, trust_remote_code=True)

# Quantize
model.quantize(tokenizer, quant_config=quant_config)

# Save quantized model
model.save_quantized(quant_path)
tokenizer.save_pretrained(quant_path)

print(f'Model is quantized and saved at "{quant_path}"')
python quant.py 
Traceback (most recent call last):
  File "/home/ubuntu/quant.py", line 9, in <module>
    model = AutoAWQForCausalLM.from_pretrained(model_path, trust_remote_code=True)
            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/ubuntu/miniconda3/envs/awq/lib/python3.11/site-packages/awq/models/auto.py", line 73, in from_pretrained
    model_type = check_and_get_model_type(
                 ^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/ubuntu/miniconda3/envs/awq/lib/python3.11/site-packages/awq/models/auto.py", line 49, in check_and_get_model_type
    raise TypeError(f"{config.model_type} isn't supported yet.")
TypeError: deepseek_v3 isn't supported yet.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant