Information you should monitor regularly

11/22/2022 Contributors

StorageGRID is a fault-tolerant, distributed storage system that is designed to continue operating even when errors occur, or when nodes or sites are unavailable. You must proactively monitor system health, workloads, and usage statistics so that you can take action to address potential issues before they affect the grid's efficiency or availability.

A busy system generates large amounts of information. This section provides guidance about the most important information to monitor on an ongoing basis. This section contains the following sub-sections:

What to monitor	Frequency
The system health data shown on the Grid Manager DashboardNote if anything has changed from the previous day.	Daily
Rate at which Storage Node object and metadata capacity is being consumed	Weekly
Information lifecycle management operations	Weekly
Performance, networking, and system resources: Query latency Connectivity and networking Node-level resources	Weekly
Tenant activity	Weekly
Capacity of the external archival storage system	Weekly
Load balancing operations	After the initial configuration and after any configuration changes
Availability of software hotfixes and software upgrades	Monthly

What to monitor

Frequency

The system health data shown on the Grid Manager DashboardNote if anything has changed from the previous day.

Daily

Rate at which Storage Node object and metadata capacity is being consumed

Weekly

Information lifecycle management operations

Weekly

Performance, networking, and system resources:

Query latency
Connectivity and networking
Node-level resources

Weekly

Tenant activity

Weekly

Capacity of the external archival storage system

Weekly

Load balancing operations

After the initial configuration and after any configuration changes

Availability of software hotfixes and software upgrades

Monthly

Information you should monitor regularly

Creating your file...