Skip to main content

Run diagnostics

Contributors netapp-perveilerk ssantho3 netapp-lhalbert

When troubleshooting an issue, you can work with technical support to run diagnostics on your StorageGRID system and review the results.

Before you begin
  • You are signed in to the Grid Manager using a supported web browser.

  • You have specific access permissions.

About this task

The Diagnostics page performs a set of diagnostic checks on the current state of the grid. Each diagnostic check can have one of three statuses:

  • Icon Alert Green check mark Normal: All values are within the normal range.

  • Icon Alert Yellow Minor Attention: One or more of the values are outside of the normal range.

  • Icon Alert Red Critical Caution: One or more of the values are significantly outside of the normal range.

Diagnostic statuses are independent of current alerts and might not indicate operational issues with the grid. For example, a diagnostic check might show Caution status even if no alert has been triggered.

Steps
  1. Select SUPPORT > Tools > Diagnostics.

    The Diagnostics page appears and lists the results for each diagnostic check. The results are sorted by severity (Caution, Attention, and then Normal). Within each severity, the results are sorted alphabetically.

    In this example, all diagnostics have a Normal status.

    Support Diagnostics Page
  2. To learn more about a specific diagnostic, click anywhere in the row.

    Details about the diagnostic and its current results appear. The following details are listed:

    • Status: The current status of this diagnostic: Normal, Attention, or Caution.

    • Prometheus query: If used for the diagnostic, the Prometheus expression that was used to generate the status values. (A Prometheus expression is not used for all diagnostics.)

    • Thresholds: If available for the diagnostic, the system-defined thresholds for each abnormal diagnostic status. (Threshold values aren't used for all diagnostics.)

      Note You can't change these thresholds.
    • Status values: A table showing the status and the value of the diagnostic throughout the StorageGRID system. In this example, the current CPU utilization for every node in a StorageGRID system is shown. All node values are below the Attention and Caution thresholds, so the overall status of the diagnostic is Normal.

    Support Diagnostics CPU Utilization
  3. Optional: To see Grafana charts related to this diagnostic, click the Grafana dashboard link.

    This link is not displayed for all diagnostics.

    The related Grafana dashboard appears. In this example, the Node dashboard appears showing CPU Utilization over time for this node as well as other Grafana charts for the node.

    Note You can also access the pre-constructed Grafana dashboards from the Grafana section of the SUPPORT > Tools > Metrics page.
    Grafana Dashboard Nodes
  4. Optional: To see a chart of the Prometheus expression over time, click View in Prometheus.

    A Prometheus graph of the expression used in the diagnostic appears.

    Support Diagnostics Prometheus Page