Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature Request] Affinity of a model to a certain GPU #135

Open
charleswg opened this issue Dec 3, 2024 · 0 comments
Open

[Feature Request] Affinity of a model to a certain GPU #135

charleswg opened this issue Dec 3, 2024 · 0 comments

Comments

@charleswg
Copy link

charleswg commented Dec 3, 2024

I wonder if there's any reason why we cannot just set an affinity of a model to a GPU if we have multiple? That to me would simplify it as if I'm running multiple models, they are not both split into for example 2 GPUs. I understand setting it to auto to split into all GPU would simplify user experience but it'd be good to have that option so that model weights don't need to be splited between multiple GPUs unnecessary and no communication necessary between GPUs. Another option for example if I have two GPU with one having more VRAM than the other, it'd be wise to put the larger model on the GPU with more VRAM than just default to splitting.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant