How to use two models in same inference code #2991

siddhantwaghjale · 2024-02-22T16:01:09Z

siddhantwaghjale
Feb 22, 2024

I'm trying with two model inference in the same code using vLLM. But while trying to load the 2nd model it fails with error
AssertionError: tensor model parallel group is already initialized.

Any help will be appreciated

thiner · 2024-02-24T08:45:34Z

thiner
Feb 24, 2024

I think it's not feasible with vllm currently(please correct me if I was wrong). But you can try to search "LLM gateway" in github.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to use two models in same inference code #2991

{{title}}

Replies: 1 comment

{{title}}

Select a reply

How to use two models in same inference code #2991

siddhantwaghjale Feb 22, 2024

Replies: 1 comment

thiner Feb 24, 2024

siddhantwaghjale
Feb 22, 2024

thiner
Feb 24, 2024