[Feature Request] Affinity of a model to a certain GPU #135

charleswg · 2024-12-03T18:22:25Z

I wonder if there's any reason why we cannot just set an affinity of a model to a GPU if we have multiple? That to me would simplify it as if I'm running multiple models, they are not both split into for example 2 GPUs. I understand setting it to auto to split into all GPU would simplify user experience but it'd be good to have that option so that model weights don't need to be splited between multiple GPUs unnecessary and no communication necessary between GPUs. Another option for example if I have two GPU with one having more VRAM than the other, it'd be wise to put the larger model on the GPU with more VRAM than just default to splitting.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request] Affinity of a model to a certain GPU #135

[Feature Request] Affinity of a model to a certain GPU #135

charleswg commented Dec 3, 2024 •

edited

Loading

[Feature Request] Affinity of a model to a certain GPU #135

[Feature Request] Affinity of a model to a certain GPU #135

Comments

charleswg commented Dec 3, 2024 • edited Loading

charleswg commented Dec 3, 2024 •

edited

Loading