Viewing a specific alert

You can view detailed information about an alert that is currently affecting your StorageGRID system or an alert that has been resolved. The details include recommended corrective actions, the time the alert was triggered, and the current value of the metrics related to this alert. Optionally, you can silence a current alert or update the alert rule.

Before you begin

Procedure

  1. Do one of the following, based on whether you want to view a current or resolved alert:
    Option Description
    Current alert
    • From the Health panel on the Dashboard, click the Current alerts link. This link appears only if at least one alert is currently active. This link is hidden if there are no current alerts or if all current alerts have been silenced.
    • Select Alerts > Current.
    • From the Nodes page, select the Overview tab for a node that has an alert icon. Then, in the Alerts section, click the alert name.
    Resolved alert
    • From the Health panel on the Dashboard, click the Recently resolved alerts link. (This link appears only if one or more alerts were triggered in the past week and are now resolved. This link is hidden if no alerts were triggered and resolved in the last week.)
    • Select Alerts > Resolved.
  2. As required, expand a group of alerts and then select the alert you want to view.
    Note: Select the alert, not the heading for a group of alerts.

    Alerts Page Select Alert
    A dialog box appears and provides details for the selected alert.
    Alerts Page Details Modal
  3. Review the alert details.
    Information Description
    title The name of the alert.
    first paragraph The description of the alert.
    Recommended actions The recommended actions for this alert.
    Time triggered The date and time the alert was triggered in your local time and in UTC.
    Time resolved For resolved alerts only, the date and time the alert was resolved in your local time and in UTC.
    Status The status of the alert: Active, Silenced, or Resolved.
    Site/Node The name of the site and node affected by the alert.
    Severity The severity of the alert.
    • Critical Icon Alert Red Critical: An abnormal condition exists that has stopped the normal operations of a StorageGRID node or service. You must address the underlying issue immediately. Service disruption and loss of data might result if the issue is not resolved.
    • Major Icon Alert Orange Major: An abnormal condition exists that is either affecting current operations or approaching the threshold for a critical alert. You should investigate major alerts and address any underlying issues to ensure that the abnormal condition does not stop the normal operation of a StorageGRID node or service.
    • Minor Icon Alert Yellow Minor: The system is operating normally, but an abnormal condition exists that could affect the system's ability to operate if it continues. You should monitor and resolve minor alerts that do not clear on their own to ensure they do not result in a more serious problem.
    data values The current value of the metric for this alert. For some alerts, additional values are shown to help you understand and investigate the alert. For example, the values shown for a Low metadata storage alert include the percent of disk space used, the total amount of disk space, and the amount of disk space used.
  4. Optionally, click silence this alert to silence the alert rule that caused this alert to be triggered.
    You must have the Manage Alerts or Root access permission to silence an alert rule.
    Attention: Be careful when deciding to silence an alert rule. If an alert rule is silenced, you might not detect an underlying problem until it prevents a critical operation from completing.
  5. To view the current conditions for the alert rule:
    1. From the alert details, click View conditions.
      A pop-up appears, listing the Prometheus expression for each defined severity.
      Alerts Page Details Modal View Condition
    2. To close the pop-up, click anywhere outside of the pop-up.
  6. Optionally, click Edit rule to edit the alert rule that caused this alert to be triggered:
    You must have the Manage Alerts or Root access permission to edit an alert rule.
    Attention: Be careful when deciding to edit an alert rule. If you change trigger values, you might not detect an underlying problem until it prevents a critical operation from completing.
  7. To close the alert details, click Close.