You perform various maintenance procedures to keep your StorageGRID system up-to-date and to ensure it is performing efficiently. The Grid Manager provides tools and options to facilitate the process of performing maintenance tasks.
When a new StorageGRID feature release is available, the Software Upgrade page guides you through the process of uploading the required file and upgrading your StorageGRID system. You must upgrade all grid nodes for all data center sites from the primary Admin Node.
During a StorageGRID software upgrade, client applications can continue to ingest and retrieve object data.
If issues with the software are detected and resolved between feature releases, you might need to apply a hotfix to your StorageGRID system.
StorageGRID hotfixes contain software changes that are made available outside of a feature or patch release. The same changes are included in a future release.
The StorageGRID Hotfix page, shown below, allows you to upload a hotfix file.
The SANtricity page, shown below, allows you to upload the SANtricity OS upgrade file.
After you upload the file, you can approve the upgrade on individual Storage Nodes or all nodes. The ability to selectively approve nodes makes it easier for you to schedule the upgrade. After you approve a node for upgrade, the system performs a health check and installs the upgrade if it is applicable to the node. The upgrade might take up to 30 minutes for each appliance-based Storage Node.
You can expand a StorageGRID system by adding storage volumes to Storage Nodes, adding new grid nodes to an existing site, or adding a new data center site. If you have Storage Nodes that use the SG6060 storage appliance, you can add one or two expansion shelves to double or triple the storage capacity of the node.
You can perform expansions without interrupting the operation of your current system. When you add nodes or a site, you first deploy the new nodes and then perform the expansion procedure from the Grid Expansion page.
Grid nodes can fail if a hardware, virtualization, operating system, or software fault renders the node inoperable or unreliable.
The steps to recover a grid node depend on the platform where the grid node is hosted and on the type of grid node. Each type of grid node has a specific recovery procedure, which you must follow exactly. Generally, you try to preserve data from the failed grid node where possible, repair or replace the failed node, use the Recovery page to configure the replacement node, and restore the node's data.
For example, this flowchart shows the recovery procedure if an Admin Node has failed.
You might want to permanently remove grid nodes or a non-functional site from your StorageGRID system.
Some maintenance procedures are specific to StorageGRID nodes that are deployed on Linux or VMware, or are specific to other components of the StorageGRID solution. For example, you might want to migrate a grid node to a different Linux host or perform maintenance on an Archive Node that is connected to Tivoli Storage Manager (TSM).
You might need to perform certain procedures on a specific grid node. For example, you might need to reboot a grid node or manually stop and restart a specific grid node service. Some grid node procedures can be performed from the Grid Manager; others require you to log in to the grid node and use the node's command line.