Skip to main content

Monitor networking and system resources

Contributors netapp-pcarriga

The integrity and bandwidth of the network between nodes and sites, and the resource usage by individual grid nodes, are critical to efficient operations.

Monitor network connections and performance

Network connectivity and bandwidth are especially important if your information lifecycle management (ILM) policy copies replicated objects between sites or stores erasure-coded objects using a scheme that provides site-loss protection. If the network between sites is not available, network latency is too high, or network bandwidth is insufficient, some ILM rules might not be able to place objects where expected. This can lead to ingest failures (when the Strict ingest option is selected for ILM rules), or to poor ingest performance and ILM backlogs.

Use the Grid Manager to monitor connectivity and network performance, so you can address any issues promptly.

Additionally, consider creating network traffic classification policies so that you can monitor traffic related to specific tenants, buckets, subnets, or load balancer endpoints. You can set traffic limiting policies as needed.

Steps
  1. Select NODES.

    The Nodes page appears. Each node in the grid is listed in table format.

    Nodes menu
  2. Select the grid name, a specific data center site, or a grid node, and then select the Network tab.

    The Network Traffic graph provides a summary of overall network traffic for the grid as a whole, the data center site, or for the node.

    Nodes Page Network Traffic Graph
    1. If you selected a grid node, scroll down to review the Network Interfaces section of the page.

      Nodes Page Network Interfaces
    2. For grid nodes, scroll down to review the Network Communication section of the page.

      The Receive and Transmit tables show how many bytes and packets have been received and sent across each network as well as other receive and transmission metrics.

      Nodes Page Network Comm
  3. Use the metrics associated with your traffic classification policies to monitor network traffic.

    1. Select CONFIGURATION > Network > Traffic classification.

      The Traffic Classification Policies page appears, and the existing policies are listed in the table.

      Traffic Policy for Graph Example
    2. To view graphs that show the networking metrics associated with a policy, select the radio button to the left of the policy, and then click Metrics.

    3. Review the graphs to understand the network traffic associated with the policy.

      If a traffic classification policy is designed to limit network traffic, analyze how often traffic is limited and decide if the policy continues to meet your needs. From time to time, adjust each traffic classification policy as needed.

Monitor node-level resources

Monitor individual grid nodes to check their resource usage levels. If nodes are consistently overloaded, more nodes might be required for efficient operations.

Steps
  1. From the NODES page, select the node.

  2. Select the Hardware tab to display graphs of CPU Utilization and Memory Usage.

    Nodes page Hardware tab
  3. To display a different time interval, select one of the controls above the chart or graph. You can display the information available for intervals of 1 hour, 1 day, 1 week, or 1 month. You can also set a custom interval, which allows you to specify date and time ranges.

  4. If the node is hosted on a storage appliance or a services appliance, scroll down to view the tables of components. The status of all components should be "Nominal." Investigate components that have any other status.