INFRA-DEVOPS Contents

Federation and Regional Failover

Failover is a runbook and a test, not a diagram. Define triggers, DNS strategy, data replication, and rollback steps.

On this page

Failover Components

  • Traffic steering (DNS / GSLB)
  • Data replication (RPO/RTO targets)
  • App readiness in secondary region
  • Secrets/certs available in both regions

Runbook: Failover Drill

1) Declare drill window and freeze risky changes
2) Shift 10% traffic to secondary
3) Validate: error rate, latency, saturation
4) Promote secondary to primary (DNS/GSLB)
5) Monitor 30-60 minutes
6) Roll back if SLO burn

Failure Modes

  • DNS TTL too high → define TTL policy and test propagation.
  • Data not consistent → explicitly document RPO and acceptable loss.