Version 2026.04.1 · 4 checklist steps
Use this when dashboards show elevated p95/p99 or 5xx without a clear deploy. Validate scope before paging dependent teams.
Customer-facing latency or error-rate spike on edge/API tier.
Confirm blast radius
Scope: single region vs global; guest vs auth traffic.
Correlate deploys & flags
Last 2h deploys, feature flags, autoscaling events.
Stabilize
Traffic shift, rollback, or capacity bump with owner sign-off.
Communicate
Status page + internal channel with ETA for next update.