Test inference endpoint model config parsing from path #434

albertvillanova · 2024-12-11T10:25:05Z

Test inference endpoint model config parsing from path.

As a follow-up of this PR:

Fix ignored reuse_existing in config file #431

this PR implements a test of the parsing of the config file for evaluation in inference endpoints.

To do so, the parsing is factorized into a new method:

InferenceEndpointModelConfig.from_path(model_config_path)

Additionally, a new examples/model_configs/endpoint_model_reuse_existing.yaml is added.

Findings:

In the examples/model_configs/endpoint_model.yaml, the field generation.add_special_token is ignored
- This PR deletes this field
There is another naming misalignment: dtype in the config file is named as model_dtype InferenceEndpointModel attribute

Question: should we rename it?

I would suggest aligning both names by replacing the dtype field in the config file with model_dtype
- Note that:
  - it already exists model_name
  - and there are also instance fields called instance_type and instance_size

HuggingFaceDocBuilderDev · 2024-12-11T10:27:40Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

clefourrier · 2024-12-11T15:14:31Z

examples/model_configs/endpoint_model_reuse_existing.yaml

+  base_params:
+    # Pass either model_name, or endpoint_name and true reuse_existing
+     endpoint_name: "llama-2-7B-lighteval" # needs to be lower case without special characters
+     reuse_existing: true # defaults to false; if true, ignore all params in instance, and don't delete the endpoint after evaluation


"and does not"

clefourrier · 2024-12-11T15:15:43Z

src/lighteval/models/endpoints/endpoint_model.py

+        config["base_params"]["model_dtype"] = config["base_params"].pop("dtype", None)
+        return cls(**config["base_params"], **config.get("instance", {}))


Nice and much cleaner! We might have to add this to all model types

albertvillanova · 2024-12-12T08:07:42Z

@clefourrier I am not sure if you agree in the renaming of the dtype field in the config file to model_dtype: #434

We could address that in a subsequent PR

Implement TGI model config from path: ```python TGIModelConfig.from_path(model_config_path) ``` Follow-up to: - #434 Related to: - #439

albertvillanova added 9 commits December 11, 2024 10:46

Add example model config for existing endpoint

47f6a6e

Test InferenceEndpointModelConfig.from_path

10d93aa

Comment default main branch in example

4c8e01a

Fix typo

b0c82b8

Delete unused add_special_tokens param in endpoint example config

e3ffecc

Fix typo

30a7928

Implement InferenceEndpointModelConfig.from_path

1f5b589

Use InferenceEndpointModelConfig.from_path

6ac7667

Refactor InferenceEndpointModelConfig.from_path

e9ff0c6

albertvillanova added 2 commits December 11, 2024 11:30

Align docs

8f8927c

Merge branch 'main' into test-inference-endpoint-model-config-from-path

ade22cf

clefourrier approved these changes Dec 11, 2024

View reviewed changes

Merge branch 'main' into test-inference-endpoint-model-config-from-path

da26699

albertvillanova merged commit f907a34 into huggingface:main Dec 12, 2024
3 checks passed

This was referenced Dec 12, 2024

[FT] Align parameter names in config files and config classes #439

Open

Implement TGI model config from path #448

Merged

NathanHB pushed a commit that referenced this pull request Dec 17, 2024

Implement TGI model config from path (#448)

1b9e2c3

Implement TGI model config from path: ```python TGIModelConfig.from_path(model_config_path) ``` Follow-up to: - #434 Related to: - #439

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Test inference endpoint model config parsing from path #434

Test inference endpoint model config parsing from path #434

albertvillanova commented Dec 11, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Dec 11, 2024

clefourrier Dec 11, 2024

clefourrier Dec 11, 2024

albertvillanova commented Dec 12, 2024 •

edited

Loading

		config["base_params"]["model_dtype"] = config["base_params"].pop("dtype", None)
		return cls(config["base_params"], config.get("instance", {}))

Test inference endpoint model config parsing from path #434

Test inference endpoint model config parsing from path #434

Conversation

albertvillanova commented Dec 11, 2024 • edited Loading

HuggingFaceDocBuilderDev commented Dec 11, 2024

clefourrier Dec 11, 2024

Choose a reason for hiding this comment

clefourrier Dec 11, 2024

Choose a reason for hiding this comment

albertvillanova commented Dec 12, 2024 • edited Loading

albertvillanova commented Dec 11, 2024 •

edited

Loading

albertvillanova commented Dec 12, 2024 •

edited

Loading