INFRA-DEVOPS Contents

Dashboards That Drive Action

Create dashboards that drive decisions: golden paths, drill-downs, and runbook links for fast response.

On this page

Dashboards That Operators Use

  • Start with a "service overview": traffic, errors, latency, saturation.
  • Provide drill-down links: logs query, traces filter, runbook.
  • Show deploy markers and config changes.

Core Panels

  • RPS and error rate (stacked by status class)
  • Latency percentiles (p50/p95/p99)
  • Dependency latency/errors
  • Resource saturation (CPU/memory/disk/net)

Checklist

  1. Can I answer "is the service healthy" in 10 seconds?
  2. Can I find "what changed" quickly?
  3. Can I pivot to logs/traces in one click?

Failure Modes

  • Pretty dashboards with no decisions: no thresholds, no runbooks.
  • Too many panels: slow load and cognitive overload.