k8s pod ,After running for a while, the GPU cannot be found in the pod. Failed to initialize NVML: Unknown Error #981
Labels
lifecycle/stale
Denotes an issue or PR has remained open with no activity and has become stale.
environmental
k8s
k8s .123.7
docker
containerd
device-plugin
exec pod
kubelet
kubelet 's cpuManagerPolicy is static
root@g007:/var/lib/kubelet# cat /var/lib/kubelet/config.yaml | grep cpuManagerPolicy cpuManagerPolicy: static
The text was updated successfully, but these errors were encountered: