Skip to main content

Monitor system alerts

Contributors netapp-pcarriga netapp-dbagwell amgrissino

You can monitor alerts, which are information, warnings, or errors that indicate how well the cluster is running.

Alerts are cluster faults or errors and are reported as they occur. Most errors resolve themselves automatically; however, some might require manual intervention. The system reports alert error codes with each alert on the Alerts page. Error codes help you determine what component of the system experienced the alert and why the alert was generated. See System alerts list for a descriptions and remediation steps.

After you resolve the issue, the system polls itself and identifies the issue as resolved. Then, all information about the alert including the date it was resolved is moved to the Resolved view.

  1. From the vCenter Plug-in, open the Management tab:

    • Beginning with Element vCenter plug-in 5.0, select NetApp Element Remote Plugin > Management > Management.

    • For Element vCenter plug-in 4.10 and earlier, select NetApp Element Management > Management.

  2. Select Reporting > Alerts.

  3. Monitor the following cluster alert information:

    • ID: Unique ID for a cluster alert.

    • Severity

      • warning: A minor issue that might soon require attention. System upgrades are still allowed at this severity level.

      • error: A failure that might cause performance degradation or loss of high availability (HA). Errors generally should not affect service otherwise.

      • critical: A serious failure that affects service. The system is unable to serve API or client I/O requests. Operating in this state could lead to potential loss of data.

      • bestPractice: A recommended system configuration best practice is not being used.

    • Type

      • node: Fault affecting an entire node.

      • drive: Fault affecting an individual drive.

      • cluster: Fault affecting the entire cluster.

      • service: Fault affecting a service on the cluster.

      • volume: Fault affecting a volume on the cluster.

    • Node: Node ID for the node that this fault refers to. Included for node and drive faults, otherwise set to - (dash).

    • Drive ID: Drive ID for the drive that this fault refers to. Included for drive faults, otherwise set to - (dash).

    • Error Code: A descriptive code that indicates what caused the fault.

    • Details: Detailed description of the fault.

    • Time: This heading is visible only in the Active filter view. The date and time the fault was logged.

    • Resolution Date: This heading is visible only in the Resolved filter view. The date and time the fault was resolved.

  4. To validate that the issue was resolved, look for it in the Resolved view.