Commands for monitoring the health of your system

Contributors

You can use the system health commands to display information about the health of system resources, to respond to alerts, and to configure future alerts. Using the CLI commands enables you to view in-depth information about how health monitoring is configured. The man pages for the commands contain more information.

Display the status of system health

If you want to…​ Use this command…​

Display the health status of the system, which reflects the overall status of individual health monitors

system health status show

Display the health status of subsystems for which health monitoring is available

system health subsystem show

Display the status of node connectivity

If you want to…​ Use this command…​

Display details about connectivity from the node to the storage shelf, including port information, HBA port speed, I/O throughput, and the rate of I/O operations per second

storage shelf show -connectivity

Use the -instance parameter to display detailed information about each shelf.

Display information about drives and array LUNs, including the usable space, shelf and bay numbers, and owning node name

storage disk show

Use the -instance parameter to display detailed information about each drive.

Display detailed information about storage shelf ports, including port type, speed, and status

storage port show

Use the -instance parameter to display detailed information about each adapter.

Manage the discovery of cluster and management network switches

If you want to…​ Use this command…​

Display the switches that the cluster monitors

system cluster-switch show

Display the switches that the cluster currently monitors, including switches that you deleted (shown in the Reason column in the command output), and configuration information that you need for network access to the cluster and management network switches.

This command is available at the advanced privilege level.

system cluster-switch show-all

Configure discovery of an undiscovered switch

system cluster-switch create

Modify information about a switch that the cluster monitors (for example, device name, IP address, SNMP version, and community string)

system cluster-switch modify

Disable monitoring of a switch

system cluster-switch modify -disable-monitoring

Display the interval in which the health monitor polls switches to gather information

system cluster-switch polling-interval show

Modify the interval in which the health monitor polls switches to gather information.

This command is available at the advanced privilege level.

system cluster-switch polling-interval modify

Disable discovery and monitoring of a switch and delete switch configuration information

system cluster-switch delete

Permanently remove the switch configuration information which is stored in the database (doing so reenables automatic discovery of the switch)

system cluster-switch delete -force

Enable automatic logging to send with AutoSupport messages.

This command is available at the advanced privilege level.

system cluster-switch log

Respond to generated alerts

If you want to…​ Use this command…​

Display information about generated alerts, such as the resource and node where the alert was triggered, and the alert’s severity and probable cause

system health alert show

Display information about each generated alert

system health alert show -instance

Indicate that someone is working on an alert

system health alert modify

Acknowledge an alert

system health alert modify -acknowledge

Suppress a subsequent alert so that it does not affect the health status of a subsystem

system health alert modify -suppress

Delete an alert that was not automatically cleared

system health alert delete

Display information about the AutoSupport messages that alerts triggered within the last week, for example, to determine whether an alert triggered an AutoSupport message

system health autosupport trigger history show

Configure future alerts

If you want to…​ Use this command…​

Enable or disable the policy that controls whether a specific resource state raises a specific alert

system health policy definition modify

Display information about how health monitoring is configured

If you want to…​ Use this command…​

Display information about health monitors, such as their nodes, names, subsystems, and status

system health config show

Note

Use the -instance parameter to display detailed information about each health monitor.

Display information about the alerts that a health monitor can potentially generate

system health alert definition show

Note

Use the -instance parameter to display detailed information about each alert definition.

Display information about health monitor policies, which determine when alerts are raised

system health policy definition show

Note

Use the -instance parameter to display detailed information about each policy. Use other parameters to filter the list of alerts—​for example, by policy status (enabled or not), health monitor, alert, and so on.

Downgrade software versions between minor releases

When downgrading a cluster from Data ONTAP 8.2.1 to Data ONTAP 8.2, ONTAP clears the history of deleted switches. It also sets the Model column to OTHER if a switch that it automatically discovers is not supported for health monitoring.

If you want to…​ Use this command…​

Downgrade the software version command

system cluster-switch prepare-to-downgrade