Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

AAW Dev: Resource Utilization #1998

Open
10 tasks
Jose-Matsuda opened this issue Nov 27, 2024 · 0 comments
Open
10 tasks

AAW Dev: Resource Utilization #1998

Jose-Matsuda opened this issue Nov 27, 2024 · 0 comments
Labels
kind/epic An epic

Comments

@Jose-Matsuda
Copy link
Contributor

Jose-Matsuda commented Nov 27, 2024

Track issues related to resource utilization in AAW dev in order to minimize cloud costs. This includes investigation and action items for them.

The couple teams we have to involve or have involved are;
FinOps (has been reached out to by me and we have a machine size to go off of, Standard_D2ds_v5
CNS so we can modify our terraform (we can do this already, we do not need to reach out anymore)

The general steps will be as follows

  1. Investigate, we want to baseline costs, investigate what our current node usage is at, investigate daemonsets, investigate machine types that we can migrate to.
  2. Re-size workloads. Change requests for those that have been specified (there's no need to add requests if not specified) to something that better reflects its actual usage (use grafana). This will need to be broken down per nodepool
  3. Re-size nodepools. We need to resize nodepools to use machines that better reflect our usage
  4. Review Node selection logic: are pods being scheduled where they shouldn't be?
  5. Investigate Shutting down node pools: shut down machines when not in use

Issues

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/epic An epic
Projects
None yet
Development

No branches or pull requests

1 participant