Learn about available ONTAP cluster health monitors
There are several health monitors that monitor different parts of a cluster. Health monitors help you to recover from errors within ONTAP systems by detecting events, sending alerts to you, and deleting events as they clear.
Health monitor name (identifier) | Subsystem name (identifier) | Purpose |
---|---|---|
Ethernet switch |
Switch (Switch-Health) |
The ONTAP Ethernet Switch Health Monitor (CSHM) monitors the status of cluster and storage network switches while collecting logs for analysis. By default, CSHM polls each switch via SNMPv2c every 5 minutes to update resource tables with information on supportability, monitoring status, temperature sensors, CPU utilization, interface configurations and connections, cluster switch redundancy, and fan and power supply operations. Additionally, if configured, CSHM collects logs via SSH/SCP every hour, which are sent through AutoSupport for further analysis. Upon request, CSHM can also perform a more detailed tech-support log collection using SSH/SCP. See Switch health monitoring for further details. |
MetroCluster Fabric |
Switch |
Monitors the MetroCluster configuration back-end fabric topology and detects misconfigurations such as incorrect cabling and zoning, and ISL failures. |
MetroCluster Health |
Interconnect, RAID, and storage |
Monitors FC-VI adapters, FC initiator adapters, left-behind aggregates and disks, and inter-cluster ports |
Node connectivity(node-connect) |
CIFS nondisruptive operations (CIFS-NDO) |
Monitors SMB connections for nondisruptive operations to Hyper-V applications. |
Storage (SAS-connect) |
Monitors shelves, disks, and adapters at the node level for appropriate paths and connections. |
|
System |
not applicable |
Aggregates information from other health monitors. |
System connectivity (system-connect) |
Storage (SAS-connect) |
Monitors shelves at the cluster level for appropriate paths to two HA clustered nodes. |