From 82a3dfde86e8861b7438631b656cf686755dd310 Mon Sep 17 00:00:00 2001 From: Wauplin Date: Tue, 17 Sep 2024 13:27:45 +0200 Subject: [PATCH 1/3] Add more PRO models --- docs/api-inference/supported-models.md | 5 +++++ 1 file changed, 5 insertions(+) diff --git a/docs/api-inference/supported-models.md b/docs/api-inference/supported-models.md index b8c82c207..63387c138 100644 --- a/docs/api-inference/supported-models.md +++ b/docs/api-inference/supported-models.md @@ -12,12 +12,17 @@ You can find: In addition to thousands of public models available in the Hub, PRO and Enterprise users get higher [rate limits](./rate-limits) and free access to the following models: + | Model | Size | Context Length | Use | |--------------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----------------|--------------------------------------------------------------| | Meta Llama 3.1 Instruct | [8B](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct), [70B](https://huggingface.co/meta-llama/Meta-Llama-3.1-70B-Instruct) | 128k tokens | High quality multilingual chat model with large context length | | Meta Llama 3 Instruct | [8B](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct), [70B](https://huggingface.co/meta-llama/Meta-Llama-3-70B-Instruct) | 8k tokens | One of the best chat models | +| Meta Llama Guard 3 | [8B](hhttps://huggingface.co/meta-llama/Llama-Guard-3-8B) | 4k tokens | | | Llama 2 Chat | [7B](https://huggingface.co/meta-llama/Llama-2-7b-chat-hf), [13B](https://huggingface.co/meta-llama/Llama-2-13b-chat-hf), [70B](https://huggingface.co/meta-llama/Llama-2-70b-chat-hf) | 4k tokens | One of the best conversational models | +| DeepSeek Coder v2 | [236B](https://huggingface.co/deepseek-ai/DeepSeek-Coder-V2-Instruct) | 16k tokens | A model with coding capabilities. | +| Aya 23 | [8B](https://huggingface.co/CohereForAI/aya-23-8B), [35B](https://huggingface.co/CohereForAI/aya-23-35B) | 4k tokens | A model with multilingual capabilities. | +| C4AI Command R+ 08-2024 | [104B](https://huggingface.co/CohereForAI/c4ai-command-r-plus-08-2024) | 32k tokens | A model with high quality tool calling capabilities. | Bark | [0.9B](https://huggingface.co/suno/bark) | - | Text to audio generation | From daf618230c8513c8e18c7c5ee6d6fced91ed2c8d Mon Sep 17 00:00:00 2001 From: Lucain Date: Tue, 17 Sep 2024 23:52:25 +0200 Subject: [PATCH 2/3] Update docs/api-inference/supported-models.md Co-authored-by: Omar Sanseviero --- docs/api-inference/supported-models.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs/api-inference/supported-models.md b/docs/api-inference/supported-models.md index 63387c138..e3abffa22 100644 --- a/docs/api-inference/supported-models.md +++ b/docs/api-inference/supported-models.md @@ -18,7 +18,7 @@ In addition to thousands of public models available in the Hub, PRO and Enterpri |--------------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----------------|--------------------------------------------------------------| | Meta Llama 3.1 Instruct | [8B](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct), [70B](https://huggingface.co/meta-llama/Meta-Llama-3.1-70B-Instruct) | 128k tokens | High quality multilingual chat model with large context length | | Meta Llama 3 Instruct | [8B](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct), [70B](https://huggingface.co/meta-llama/Meta-Llama-3-70B-Instruct) | 8k tokens | One of the best chat models | -| Meta Llama Guard 3 | [8B](hhttps://huggingface.co/meta-llama/Llama-Guard-3-8B) | 4k tokens | | +| Meta Llama Guard 3 | [8B](https://huggingface.co/meta-llama/Llama-Guard-3-8B) | 4k tokens | | | Llama 2 Chat | [7B](https://huggingface.co/meta-llama/Llama-2-7b-chat-hf), [13B](https://huggingface.co/meta-llama/Llama-2-13b-chat-hf), [70B](https://huggingface.co/meta-llama/Llama-2-70b-chat-hf) | 4k tokens | One of the best conversational models | | DeepSeek Coder v2 | [236B](https://huggingface.co/deepseek-ai/DeepSeek-Coder-V2-Instruct) | 16k tokens | A model with coding capabilities. | | Aya 23 | [8B](https://huggingface.co/CohereForAI/aya-23-8B), [35B](https://huggingface.co/CohereForAI/aya-23-35B) | 4k tokens | A model with multilingual capabilities. | From e0659c54c462fcae06c015b881952fbcc2e64914 Mon Sep 17 00:00:00 2001 From: Wauplin Date: Mon, 23 Sep 2024 14:43:18 +0200 Subject: [PATCH 3/3] update --- docs/api-inference/supported-models.md | 3 +-- 1 file changed, 1 insertion(+), 2 deletions(-) diff --git a/docs/api-inference/supported-models.md b/docs/api-inference/supported-models.md index cbbbd7368..8edaa2215 100644 --- a/docs/api-inference/supported-models.md +++ b/docs/api-inference/supported-models.md @@ -21,10 +21,9 @@ In addition to thousands of public models available in the Hub, PRO and Enterpri | Meta Llama Guard 3 | [8B](https://huggingface.co/meta-llama/Llama-Guard-3-8B) | 4k tokens | | | Llama 2 Chat | [7B](https://huggingface.co/meta-llama/Llama-2-7b-chat-hf), [13B](https://huggingface.co/meta-llama/Llama-2-13b-chat-hf), [70B](https://huggingface.co/meta-llama/Llama-2-70b-chat-hf) | 4k tokens | One of the best conversational models | | DeepSeek Coder v2 | [236B](https://huggingface.co/deepseek-ai/DeepSeek-Coder-V2-Instruct) | 16k tokens | A model with coding capabilities. | -| Aya 23 | [8B](https://huggingface.co/CohereForAI/aya-23-8B), [35B](https://huggingface.co/CohereForAI/aya-23-35B) | 4k tokens | A model with multilingual capabilities. | -| C4AI Command R+ 08-2024 | [104B](https://huggingface.co/CohereForAI/c4ai-command-r-plus-08-2024) | 32k tokens | A model with high quality tool calling capabilities. | Bark | [0.9B](https://huggingface.co/suno/bark) | - | Text to audio generation | +This list is not exhaustive and might be updated in the future. ## Running Private Models