Commands for monitoring the health of your system

You can use the system health commands to display information about the health of system resources, to respond to alerts, and to configure future alerts. Using the CLI commands enables you to view in-depth information about how health monitoring is configured. The man pages for the commands contain more information.

Displaying the status of system health

If you want to... Use this command...
Display the health status of the system, which reflects the overall status of individual health monitors system health status show
Display the health status of subsystems for which health monitoring is available system health subsystem show

Displaying the status of node connectivity

If you want to... Use this command...

Display details about connectivity from the node to the storage shelf, including port information, HBA port speed, I/O throughput, and the rate of I/O operations per second

storage shelf show -connectivity

Use the -instance parameter to display detailed information about each shelf.

Display information about drives and array LUNs, including the usable space, shelf and bay numbers, and owning node name

storage disk show

Use the -instance parameter to display detailed information about each drive.

Display detailed information about storage shelf ports, including port type, speed, and status

storage port show

Use the -instance parameter to display detailed information about each adapter.

Managing the discovery of cluster and management network switches

If you want to... Use this command...
Display the switches that the cluster monitors system cluster-switch show
Display the switches that the cluster currently monitors, including switches that you deleted (shown in the Reason column in the command output), and configuration information that you need for network access to the cluster and management network switches

This command is available at the advanced privilege level.

system cluster-switch show-all
Configure discovery of an undiscovered switch system cluster-switch create

Modify information about a switch that the cluster monitors (for example, device name, IP address, SNMP version, and community string)

system cluster-switch modify
Disable monitoring of a switch system cluster-switch modify -disable-monitoring
Display the interval in which the health monitor polls switches to gather information system cluster-switch polling-interval show
Modify the interval in which the health monitor polls switches to gather information

This command is available at the advanced privilege level.

system cluster-switch polling-interval modify
Disable discovery and monitoring of a switch and delete switch configuration information system cluster-switch delete
Permanently remove the switch configuration information which is stored in the database (doing so reenables automatic discovery of the switch) system cluster-switch delete -force
Enable automatic logging to send with AutoSupport messages

This command is available at the advanced privilege level.

system cluster-switch log

Responding to generated alerts

If you want to... Use this command...
Display information about generated alerts, such as the resource and node where the alert was triggered, and the alert's severity and probable cause system health alert show
Display information about each generated alert system health alert show -instance
Indicate that someone is working on an alert system health alert modify
Acknowledge an alert system health alert modify -acknowledge
Suppress a subsequent alert so that it does not affect the health status of a subsystem system health alert modify -suppress
Delete an alert that was not automatically cleared system health alert delete

Display information about the AutoSupport messages that alerts triggered within the last week, for example, to determine whether an alert triggered an AutoSupport message

system health autosupport trigger history show

Configuring future alerts

If you want to... Use this command...

Enable or disable the policy that controls whether a specific resource state raises a specific alert

system health policy definition modify

Displaying information about how health monitoring is configured

If you want to... Use this command...

Display information about health monitors, such as their nodes, names, subsystems, and status

system health config show
Note: Use the -instance parameter to display detailed information about each health monitor.

Display information about the alerts that a health monitor can potentially generate

system health alert definition show
Note: Use the -instance parameter to display detailed information about each alert definition.

Display information about health monitor policies, which determine when alerts are raised

system health policy definition show
Note: Use the -instance parameter to display detailed information about each policy. Use other parameters to filter the list of alerts—for example, by policy status (enabled or not), health monitor, alert, and so on.

Downgrading software versions between minor releases

When downgrading a cluster from Data ONTAP 8.2.1 to Data ONTAP 8.2, ONTAP clears the history of deleted switches. It also sets the Model column to OTHER if a switch that it automatically discovers is not supported for health monitoring.

If you want to... Use this command...
Downgrade the software version command system cluster-switch prepare-to-downgrade