Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Garbage Collector OOM get OOM Killed #1049

Open
sebgoa opened this issue Oct 18, 2024 · 0 comments
Open

Garbage Collector OOM get OOM Killed #1049

sebgoa opened this issue Oct 18, 2024 · 0 comments

Comments

@sebgoa
Copy link

sebgoa commented Oct 18, 2024

Hi,

On a cluster with over 400 nodes, the gpu operator node feature discovery garbage collector gets OOM killed.
It has a default of 1Gi memory limit.

What is the rule of thumb to size the memory limit appropriately ?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant