Alert Runbooks

Maintenance

Runbook: Maintenance

Description

This alert triggers when the exporter for the health-check-proxy detects that a host is in maintenance.


Possible Causes


Severity estimation

This alert is not critical unless the majority of a specific streamcloud component in a geoCluster is in maintenance or the work load for streamcloud componentsin a geoCluster is too high.


Troubleshooting Steps

  1. Check maintenance dashboard


  1. Check for Maintenance announcements in CloudStatus channel

    • Action:
      • check the CloudStatus channel in Mattermost for any deployment or maintenance announcements

  1. Check alerts


  1. Remove maintenance file if certain that no reason for maintenance exists

    • Command / Action:
      • before removing the maintenance file check for firing alerts or the alert history to figure out the status of the host
      • sudo rm /var/www/maintenance

Additional resources

Grafana dashboards: