PDF of this doc site
- Get started
Install and maintain appliance hardware
SG100 and SG1000 services appliances
- Prepare for installation (SG100 and SG1000)
SG6000 storage appliances
- Prepare for installation (SG6000)
- Configure hardware (SG6000)
SG5700 storage appliances
- Prepare for installation (SG5700)
- Configure hardware (SG5700)
SG5600 storage appliances
- Prepare for installation (SG5600)
- Configure hardware (SG5600)
- SG100 and SG1000 services appliances
Install and upgrade software
- Upgrade StorageGRID software
- Install Red Hat Enterprise Linux or CentOS
- Install Ubuntu or Debian
Perform system administration
- Manage security settings
- Manage Admin Nodes
- Manage Archive Nodes
Manage objects with ILM
- ILM and object lifecycle
- Create storage grades, storage pools, EC profiles, and regions
- Administer StorageGRID
- Use a tenant account
- S3 REST API supported operations and limitations
Monitor and maintain StorageGRID
Monitor and troubleshoot
- Troubleshoot a StorageGRID system
- Expand your grid
Recover and maintain
Grid node recovery procedures
- Recover from Storage Node failures
- Recover from Admin Node failures
- All grid node types: Replace Linux node
- Grid node decommission
- Network maintenance procedures
- Grid node procedures
- Grid node recovery procedures
Review audit logs
- Audit messages and the object lifecycle
- Monitor and troubleshoot
When troubleshooting an issue, you can work with technical support to run diagnostics on your StorageGRID system and review the results.
You are signed in to the Grid Manager using a supported web browser.
You have specific access permissions.
The Diagnostics page performs a set of diagnostic checks on the current state of the grid. Each diagnostic check can have one of three statuses:
Normal: All values are within the normal range.
Attention: One or more of the values are outside of the normal range.
Caution: One or more of the values are significantly outside of the normal range.
Diagnostic statuses are independent of current alerts and might not indicate operational issues with the grid. For example, a diagnostic check might show Caution status even if no alert has been triggered.
Select SUPPORT > Tools > Diagnostics.
The Diagnostics page appears and lists the results for each diagnostic check. The results are sorted by severity (Caution, Attention, and then Normal). Within each severity, the results are sorted alphabetically.
In this example, all diagnostics have a Normal status.
To learn more about a specific diagnostic, click anywhere in the row.
Details about the diagnostic and its current results appear. The following details are listed:
Status: The current status of this diagnostic: Normal, Attention, or Caution.
Prometheus query: If used for the diagnostic, the Prometheus expression that was used to generate the status values. (A Prometheus expression is not used for all diagnostics.)
Thresholds: If available for the diagnostic, the system-defined thresholds for each abnormal diagnostic status. (Threshold values are not used for all diagnostics.)
You cannot change these thresholds.
Status values: A table showing the status and the value of the diagnostic throughout the StorageGRID system. In this example, the current CPU utilization for every node in a StorageGRID system is shown. All node values are below the Attention and Caution thresholds, so the overall status of the diagnostic is Normal.
Optional: To see Grafana charts related to this diagnostic, click the Grafana dashboard link.
This link is not displayed for all diagnostics.
The related Grafana dashboard appears. In this example, the Node dashboard appears showing CPU Utilization over time for this node as well as other Grafana charts for the node.
You can also access the pre-constructed Grafana dashboards from the Grafana section of the SUPPORT > Tools > Metrics page.
Optional: To see a chart of the Prometheus expression over time, click View in Prometheus.
A Prometheus graph of the expression used in the diagnostic appears.