forked from huggingface/lighteval
-
Notifications
You must be signed in to change notification settings - Fork 1
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
--------- Co-authored-by: Sadra Barikbin <[email protected]> Co-authored-by: Nathan Habib <[email protected]>
- Loading branch information
1 parent
2b9aacc
commit 3b48b75
Showing
10 changed files
with
55 additions
and
73 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,22 @@ | ||
model: | ||
type: "endpoint" | ||
base_params: | ||
endpoint_name: "smollm-360m-instruct-v0-2-q8-lvy" # needs to be lower case without special characters | ||
model: HuggingFaceTB/SmolLM-360M-Instruct | ||
revision: "main" | ||
dtype: "default" # can be any of "awq", "eetq", "gptq", "4bit' or "8bit" (will use bitsandbytes), "bfloat16" or "float16" | ||
reuse_existing: true # if true, ignore all params in instance, and don't delete the endpoint after evaluation | ||
instance: | ||
accelerator: "gpu" | ||
region: "eu-west-1" | ||
vendor: "aws" | ||
instance_size: "medium" | ||
instance_type: "g5.2xlarge" | ||
framework: "pytorch" | ||
endpoint_type: "protected" | ||
namespace: null # The namespace under which to launch the endopint. Defaults to the current user's namespace | ||
image_url: null # Optionally specify the docker image to use when launching the endpoint model. E.g., launching models with later releases of the TGI container with support for newer models. | ||
env_vars: | ||
null # Optional environment variables to include when launching the endpoint. e.g., `MAX_INPUT_LENGTH: 2048` | ||
generation: | ||
add_special_tokens: true |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.