[BUG] Issue with Serverless HuggingFace Inference #422

souzatharsis · 2024-12-05T20:20:25Z

Describe the bug

Running Serverless HF Inference as instructed in Documentation fails:

File "/home/tobias/src/tamingLLMs/tamingllms/.venv/lib/python3.12/site-packages/lighteval/models/tgi_model.py", line 70, in __init__
    model_precision = self.model_info["model_dtype"]
                      ~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^
KeyError: 'model_dtype'

Next, if you manually set

model_precision = "float16"

in lighteval/models/tgi_model.py", line 70

You get a second error:

ValueError: batch_size should be a positive integer value, but got batch_size=-1

which then gets resolved if you pass --override_batch_size 1 to lighteval bash command.

To Reproduce

lighteval accelerate  --model_config_path="endpoint_model.yaml" --tasks "leaderboard|mmlu:econometrics|0|0" --output_dir="./evals/"

endpoint_model.yaml:
model: type: "tgi" instance: inference_server_address: "https://api-inference.huggingface.co/models/Qwen/Qwen2.5-Math-1.5B-Instruct" inference_server_auth: "<API-KEY>" model_id: null

Expected behavior

Run benchmark using Serverless HF Inference

Version info

python = "^3.11"
lighteval = {extras = ["accelerate"], version = "^0.6.2"}

Other Suggestions

The user keeps getting ‘model is currently loading’

One recommendation would be to leverage wait_for_model param to avoid breaking the command line call if the model is still loading.

The text was updated successfully, but these errors were encountered:

clefourrier · 2024-12-12T15:14:12Z

Hi! Can you try with the code of #445 ?

souzatharsis added the bug Something isn't working label Dec 5, 2024

clefourrier mentioned this issue Dec 12, 2024

Adds serverless endpoints back #445

Merged

clefourrier closed this as completed in #445 Dec 17, 2024

souzatharsis mentioned this issue Dec 17, 2024

[Evals] Test HF lighteval bug fix souzatharsis/tamingLLMs#10

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] Issue with Serverless HuggingFace Inference #422

[BUG] Issue with Serverless HuggingFace Inference #422

souzatharsis commented Dec 5, 2024

clefourrier commented Dec 12, 2024

[BUG] Issue with Serverless HuggingFace Inference #422

[BUG] Issue with Serverless HuggingFace Inference #422

Comments

souzatharsis commented Dec 5, 2024

Describe the bug

To Reproduce

Expected behavior

Version info

Other Suggestions

clefourrier commented Dec 12, 2024