Loading...
Playbook
Track p95/p99 regression and isolate DB, cache, or network causes quickly.
Track p95/p99 by endpoint
# Prometheus example
histogram_quantile(0.95, sum(rate(http_request_duration_seconds_bucket[5m])) by (le, route))Trace slow requests
curl -H "x-request-id: trace-me" -w "time_total=%{time_total}
" -o /dev/null -s https://api.example.com/healthCheck network RTT
ping -c 5 api.example.com