What cause the runtime error nvmlDeviceGetNvLinkRemoteDeviceType_ #6533
Unanswered
HanweiLi-cqu
asked this question in
Q&A
Replies: 2 comments 1 reply
-
by the way, my environment as follows:
|
Beta Was this translation helpful? Give feedback.
0 replies
-
Do you solve this? |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
my code:
when I set the
tensor_parallel_size=8
, get the runtime errors:I thought it might be a problem with memory usage, so I tried to use a different combination of parameters, and the result was as follows:
I'm curious as to why this is the case. I'm a beginner to VLLM, is there any optimization method of VLLM that causes this to happen?
Beta Was this translation helpful? Give feedback.
All reactions