Information you should monitor regularly
StorageGRID is a fault-tolerant, distributed storage system that is designed to continue operating even when errors occur, or when nodes or sites are unavailable. You must proactively monitor system health, workloads, and usage statistics so that you can take action to address potential issues before they affect the grid's efficiency or availability.
A busy system generates large amounts of information. This section provides guidance about the most important information to monitor on an ongoing basis. This section contains the following sub-sections:
What to monitor | Frequency |
---|---|
The system health data shown on the Grid Manager DashboardNote if anything has changed from the previous day. |
Daily |
Rate at which Storage Node object and metadata capacity is being consumed |
Weekly |
Information lifecycle management operations |
Weekly |
Performance, networking, and system resources:
|
Weekly |
Tenant activity |
Weekly |
Capacity of the external archival storage system |
Weekly |
Load balancing operations |
After the initial configuration and after any configuration changes |
Availability of software hotfixes and software upgrades |
Monthly |