Alerting that is not noise
Goal: Create alerts that map to user impact and are actionable.
- Start with SLO signals (latency, errors, saturation).
- Create recording rules for expensive queries.
- Add alert labels and runbooks.
- Test alerts with controlled failures.
- Review alert volume and iterate.