Deploying the recovery grid node in VMware vSphere

You start the recovery of a grid node by deploying a new virtual machine using the VMware vSphere Web Client.

Before you begin

You must have downloaded the VMware installation archive, and have extracted the following files:

Place all of these files in the same directory.

CAUTION:
You must deploy the new VM using the same StorageGRID Webscale version as is currently running on the grid.

Steps

  1. Open VMware vSphere Web Client, and sign in.
  2. Navigate to the vApp or resource pool where you want to deploy the StorageGRID Webscale grid, and select Actions > All vCenter Actions > Deploy OVF Template.
  3. Use the Deploy OVF Template dialog to deploy the .ovf file for the type of grid node you are deploying.
  4. Specify the name of the virtual machine.
    Using the host name of the failed grid node that you are replacing makes it easier to identify the virtual machine later. For example: DC3-SN1
    It is a best practice to name the VM during the initial virtual machine deployment instead of changing it later.
  5. On the Setup networks page, map each StorageGRID Webscale network to its appropriate destination network in the VMware vSphere environment:
    1. For Grid Network, select the network you will use as the Grid Network in your VMware vSphere environment.
      The Grid Network is required.
    2. If you plan to use an Admin Network, select the network you will use as the Admin Network in your VMware vSphere environment. Otherwise, map Admin Network to the destination network you selected for the Grid Network.
    3. If you plan to use a Client Network, select the network you will use as the Client Network in your VMware vSphere environment. Otherwise, map Client Network to the destination network you selected for the Grid Network.
  6. Provide the required StorageGRID Webscale information on the Customize template page.
    1. Enter the Node Name.
    2. In the Grid Network (eth0) section, enter the following information:
      • Grid network IP
      • Grid network mask
      • Grid network gateway
      • Primary Admin IP
    3. In the Admin Network (eth1) section, enter the following information:
      • Admin network IP configuration: Select Enabled or Disabled.
      • Admin network IP
      • Admin network mask
      • Grid network gateway
      • Primary Admin IP
      • Admin network external subnet list
    4. In the Client Network (eth2) section, enter the following information:
      • Client network IP configuration: Select Enabled or Disabled.
      • Client network IP
      • Client network mask
      • Client network gateway
    If you omit the primary Admin Node IP address, the IP address will be automatically discovered if the primary Admin Node, or at least one other grid node with ADMIN_IP configured, is present on the same subnet.
  7. Click Finish.
  8. Click Next, ensure the Power on after deployment option is unchecked, and then click Finish to start the upload of the virtual machine.
  9. If this is not a full node recovery, perform these steps after deployment is complete:
    1. Right-click the virtual machine, and select the Edit Settings tab.
    2. Select each default virtual hard disk that has been designated for storage, and click the Remove button located at the top of the tab.
    3. Depending on your data recovery circumstances, add new virtual disks according to your storage requirements, or reattach any virtual hard disks preserved from the previously removed failed grid node, or both.
      In general, if you are adding new disks you should use the same type of storage device that was in use prior to node recovery.
      Attention: The Storage Node OVF provided defines several VMDKs for storage. You should remove these and assign appropriate VMDKs or RDMs for storage before powering up the node. VMDKs are more commonly used in VMware environments and are easier to manage, while RDMs may provide better performance for workloads that use larger object sizes (for example, greater than 100 MB).
  10. Power on the virtual machine.

After you finish

To complete the recovery, return to the procedure for the failure you are addressing.
Type of recovery Reference
Primary Admin Node Configuring the recovery primary Admin Node
Non-primary Admin Node Selecting Start Recovery to configure the non-primary Admin Node
API Gateway Node Selecting Start Recovery to configure the API Gateway Node
Archive Node Selecting Start Recovery to configure the Archive Node
Storage Node (virtual) Selecting Start Recovery to configure the Storage Node