INFRA-DEVOPS Contents

Capacity Modeling at Scale

Capacity planning at scale requires headroom, forecasts, and failover capacity. Treat capacity as an SLO dependency.

On this page

Capacity Inputs

  • Historical usage + peak patterns
  • Growth forecasts
  • Failover load requirements
  • Platform overhead (daemonsets, logging, monitoring)

Operational Rules of Thumb

  • Keep target utilization below a defined threshold
  • Reserve capacity for incident response and failover
  • Alert on approaching saturation early

Checklist: Before a Big Launch

- Confirm headroom (CPU/mem) in all regions
- Confirm HPA/VPA policies
- Confirm autoscaler limits and quotas
- Run load test and validate SLOs
- Prepare rollback and traffic shift plan

Failure Modes

  • Autoscaler constrained by quotas → align quotas with scaling expectations.