Does vllm support deploy models on vuda? #9691

Gaohang0804 · 2024-10-25T08:37:07Z

Gaohang0804
Oct 25, 2024

I'm trying to deploy internVL2 model with vllm on k8s，I have succeed on 1 A800 80G GPU，but because the low memory required of my model，many gpu memory is wasted.
I just wondering could deploy my model on virtualized cuda，and required lower memory on one k8s pod.

        limits:
          cpu: '15'
          memory: 16Gi
          tencent.com/vcuda-core: '100'
          tencent.com/vcuda-memory: '320'
        requests:
          cpu: '15'
          memory: 16Gi
          tencent.com/vcuda-core: '100'
          tencent.com/vcuda-memory: '320'

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Does vllm support deploy models on vuda? #9691

{{title}}

Replies: 0 comments

Select a reply

Does vllm support deploy models on vuda? #9691

Gaohang0804 Oct 25, 2024

Replies: 0 comments

Gaohang0804
Oct 25, 2024