Recovery procedures

When you recover a failed grid node, you must replace the failed physical or virtual server, reinstall StorageGRID Webscale software, and ensure all recoverable data is intact. You must follow the specific recovery procedure for the type of grid node that failed.

Grid nodes fail if a hardware, virtualization, operating system, or software fault renders the node inoperable or unreliable. There are many kinds of failure that can trigger the need to recover a grid node, and the procedures to recover the grid node vary depending on the platform where the grid node is hosted, and the type of grid node that you need to recover.

Generally, you try to preserve data from the failed grid node where possible, repair or replace the failed host, and then restore the StorageGRID Webscale node to the host. After grid node software is running, you restore data and connections for the restored node so that it can function properly in the grid.

If more than one grid node is hosted on a server, it is possible to recover them in any order. However, if a failed server hosts a primary Admin Node, recover it first. This prevents delays due to the other nodes attempting to contact the primary Admin Node during their recovery.