If an entire StorageGRID site fails or if multiple Storage Nodes fail, you must contact technical support. Technical support will assess your situation, develop a recovery plan, and then recover the failed nodes or site in a way that meets your business objectives, optimizes recovery time, and prevents unnecessary data loss.
CAUTION:
Site recovery can only be performed by technical support.
StorageGRID systems are resilient to a wide variety of failures, and you can successfully perform many recovery and maintenance procedures yourself. However, it is difficult to create a simple, generalized site recovery procedure because the detailed steps depend on factors that are specific to your situation. For example:
Overview of site recovery
This is a general overview of the process that technical support uses to recover a failed site.
CAUTION:
Site recovery can only be performed by technical support.
- Contact technical support.
Technical support does a detailed assessment of the failure and works with you to review your business objectives. Based on this information, technical support develops a recovery plan tailored for your situation.
- Technical support recovers the primary Admin Node if it has failed.
- Technical support recovers all Storage Nodes, following this outline:
- Replace Storage Node hardware or virtual machines as required.
- Restore object metadata to the failed site.
- Restore object data to the recovered Storage Nodes.
CAUTION:
Data loss will occur if the recovery procedures for a single failed Storage Node are used.
Note: When an entire site has failed, specialized commands are required to successfully restore objects and object metadata.
- Technical support recovers other failed nodes.
After object metadata and data have been recovered, failed Gateway Nodes, non-primary Admin Nodes, or Archive Nodes can be recovered using standard procedures.