Prometheus Alerting
Configure effective Prometheus alerting rules and Alertmanager routing for proactive infrastructure monitoring.
Usage
Describe the metrics you want to alert on and this skill will generate Prometheus alerting rules and Alertmanager configuration.
Examples
- "Alert when CPU usage exceeds 80% for 5 minutes"
- "Set up alerts for high error rates on my HTTP endpoints"
- "Create disk space warnings at 70%, 85%, and 95% thresholds"
Guidelines
- Use for duration to avoid alerting on transient spikes
- Include meaningful annotations with runbook links
- Group related alerts to reduce notification noise
- Set appropriate severity labels for routing priority
- Test alert rules using promtool before deploying