Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

AAW Prod: Resource Utilization #2010

Open
5 tasks
Jose-Matsuda opened this issue Dec 11, 2024 · 0 comments
Open
5 tasks

AAW Prod: Resource Utilization #2010

Jose-Matsuda opened this issue Dec 11, 2024 · 0 comments
Labels
kind/epic An epic

Comments

@Jose-Matsuda
Copy link
Contributor

Similar to #1998

Track issues related to resource utilization in AAW prod in order to minimize cloud costs. This includes investigation and action items for them.

The general steps will be as follows

Investigate, we want to baseline costs, investigate what our current node usage is at, investigate daemonsets, investigate machine types that we can migrate to.
Re-size workloads. Change requests for those that have been specified (there's no need to add requests if not specified) to something that better reflects its actual usage (use grafana). This will need to be broken down per nodepool
Re-size nodepools. We need to resize nodepools to use machines that better reflect our usage, can we downsize some to say Standard_D2ds_v5?
Review Node selection logic: are pods being scheduled where they shouldn't be?
Investigate Shutting down node pools: shut down machines when not in use

Issues

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/epic An epic
Projects
None yet
Development

No branches or pull requests

1 participant