Add `containers/tei/{cpu,gpu}/1.6.0` #132

alvarobartt · 2024-12-12T15:40:46Z

Description

This PR adds a new container for TEI v1.6.0 just released (see the release notes at https://github.com/huggingface/text-embeddings-inference/releases/tag/v1.6.0).

The main feature on TEI v1.6.0 w.r.t. TEI v1.5.0 is that it now supports multiple CPU backends, not just ONNX, meaning that it can also serve embedding models on CPU with backends other than ONNX (since not every model on the Hub comes with an ONNX-converted version of the weights). Some other features include the addition of the General Text Embeddings (GTE) heads, the implementation of MPNet, fixes around the health checks, and much more.

Note

Note that this PR also includes the changes from the https://github.com/huggingface/text-embeddings-inference/releases/tag/v1.5.1 release.

To inspect the changes required to make the TEI container work in GCP, see the diff at:

TEI on GPU: ca1a6c3
TEI on CPU: 938c9d9

philschmid

LGTM!

philschmid · 2024-12-23T18:13:58Z

How does this CPU multibackend work? Does it check if there are *.onnx weights and if so use them if not use normal pytorch + candle?

alvarobartt added 3 commits December 12, 2024 16:17

Add containers/tei/{cpu,gpu}/1.6.0 baseline

b88bf16

Update TEI Dockerfile for CPU and add entrypoint.sh

938c9d9

Update TEI Dockerfile and entrypoint.sh for GPU

ca1a6c3

alvarobartt added container tei labels Dec 12, 2024

alvarobartt requested a review from philschmid December 12, 2024 15:40

alvarobartt self-assigned this Dec 12, 2024

Update tei/{cpu,gpu}/1.6.0/Dockerfile

9b5a854

philschmid approved these changes Dec 23, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `containers/tei/{cpu,gpu}/1.6.0` #132

Add `containers/tei/{cpu,gpu}/1.6.0` #132

alvarobartt commented Dec 12, 2024

philschmid left a comment

philschmid commented Dec 23, 2024

Add containers/tei/{cpu,gpu}/1.6.0 #132

Are you sure you want to change the base?

Add containers/tei/{cpu,gpu}/1.6.0 #132

Conversation

alvarobartt commented Dec 12, 2024

Description

philschmid left a comment

Choose a reason for hiding this comment

philschmid commented Dec 23, 2024

Add `containers/tei/{cpu,gpu}/1.6.0` #132

Add `containers/tei/{cpu,gpu}/1.6.0` #132