Runbook: RDSExporterErrors Alert

Alert Details

  • Alert Name: RDSExporterErrors
  • Expression: increase(rds_exporter_errors_total{%(clusterSelector)s}[1h]) > 5

Description

Alert is triggered when the Prometheus RDS exporter is reporting errors continuously.

Possible Causes

Troubleshooting Steps

1. Look at Prometheus RDS exporter logs. The level=ERROR message should explain why AWS metrics can’t be fetched.

Common situations:

The exporter can’t be authenticated on AWS APIs AWS role or AWS user used by Prometheus RDS exporter don’t have required permissions

2. Depending on the error message, fix AWS authentication/permissions and watch the number of errors.

Alert will automatically be resolved when exporter stops generating new errors.