Runbook: Maintenance

Alert Details

  • Alert Name: Maintenance
  • Expression: maintenance{job="health-check-proxy"} == 1

Description

This alert triggers when the exporter for the health-check-proxy detects that a host is in maintenance.

Possible Causes

  • Server is in maintenance
  • Maintenance file was not removed after maintenance

Troubleshooting Steps

  1. Check for Maintenance announcements in CloudStatus channel

  2. Remove maintenance file if certain that no reason for maintenance exists

    • Command: sudo rm /var/www/maintenance