Removing failed drives from the cluster

Before replacing the failed drive, you must remove it from the cluster. You must get details, such as ID and serial number of the node that the failed drive is part of, and slot number of the failed drive.

About this task

The instructions apply if you have H610S nodes as part of your NetApp HCI installation or a SolidFire all-flash storage installation. Drives in the Alerts list in the NetApp Element software user interface (UI) show as blockServiceUnhealthy when a node is offline. When restarting the node, if the node and its drives come back online within five and a half minutes, the drives automatically update and continue as active drives in the cluster.

You can use the Element UI or the Element Plug-in for vCenter to remove the failed drive from the cluster. When you remove a drive, the system migrates the data on the drive to other available drives in the system. The duration of data migration depends on the amount of data that is being migrated.

Steps

  1. If you are using the Element UI, perform the following steps:
    1. Select Cluster > Drives .
    2. Click Failed to view the list of failed drives.
    3. Note the slot number of the failed drive. You need this information to locate the failed drive in the chassis.
    4. Click Actions for the drive you want to remove.
    5. Click Remove.
    6. Proceed to the next task to physically remove the failed drive.
  2. If you are using the Element Plug-in for vCenter, perform the following steps:
    1. From the Element Plug-in for vCenter, copy the management virtual IP (MVIP) address from the UI.

      In this screenshot, the MVIP IP address is highlighted.
      In this screenshot, the MVIP IP address is highlighted.

    2. Open a web browser, and enter the MVIP address to access the Element UI.
    3. Log in to the Element UI using the user name and password that you configured in the NetApp Deployment Engine.
    4. If the drive has already failed, perform the following steps:
      1. Click Cluster > Drives > Failed . Record the node ID and slot number associated with the failed drive. You need this information to locate the failed drive in your chassis.
      2. Click Nodes. Record the serial number (service tag) for the node ID that you noted down in the previous step.
      3. Click Drives > Failed .
      4. Select the failed drive, and click Actions > Remove .
      5. Confirm the action.
    5. If the drive is active, but must be replaced because of wear, perform the following steps:
      1. Click Cluster > Drives > Active . Record the node ID and slot number associated with the drive that needs to be replaced. You need this information to locate the drive in your chassis.
      2. Click Nodes. Record the serial number (service tag) for the node ID that you noted down in the previous step.
      3. Click Drives > Active .
      4. Select the drive that needs to be replaced, and click Actions > Remove .
      5. Confirm the action.
    6. Proceed to the next task to physically remove the failed drive.