Skip to main content
Install and maintain

Replace Drive - AFF C30 and AFF C60

Contributors netapp-jsnyder

You can replace a failed drive nondisruptively while I/O is in progress.

Before you begin
  • The drive that you are installing must be supported by your storage system.

  • If self-encrypting drive (SED) authentication is enabled, you must use the SED replacement instructions in the ONTAP documentation.

    Instructions in the ONTAP documentation describe additional steps you must perform before and after replacing an SED.

  • All other components in the storage system must be functioning properly, if not, you must contact NetApp Support before continuing with this procedure.

  • Verify that the drive you are removing is failed.

    You can verify that the drive is failed by running the storage disk show -broken command. The failed drive appears in the list of failed drives. If it does not, you should wait, and then run the command again.

    Note Depending on the drive type and capacity, it can take up to several hours for the drive to appear in the list of failed drives.
About this task
  • When replacing a failed drive, you must wait 70 seconds between the removal of the drive and the insertion of the replacement drive to allow the storage system to recognize that a drive was removed.

  • The best practice is to have the current version of the Disk Qualification Package (DQP) installed before hot-swapping a drive.

    Having the current version of the DQP installed allows your system to recognize and use newly qualified drives. This avoids system event messages about having noncurrent drive information and prevention of drive partitioning because drives are not recognized. The DQP also notifies you of noncurrent drive firmware.

  • The best practice is to have current versions of NVMe shelf module (NSM) firmware and drive firmware on your system before replacing FRU components.

    Note

    Do not revert firmware to a version that does not support your shelf and its components.

  • Drive firmware is automatically updated (nondisruptively) on new drives that have non-current firmware versions.

    Note Drive firmware checks occur every two minutes.
  • If needed, you can turn on the storage system location (blue) LEDs to aid in physically locating the affected storage system. Log into the BMC using SSH and enter the system location-led on command.

    A storage system has three location LEDs: one on the operator display panel and one on each controller. Location LEDs remain illuminated for 30 minutes.

    You can turn them off by entering the system location-led off command. If you are unsure if the LEDs are on or off, you can check their state by entering the system location-led show command.

Steps
  1. If you want to manually assign drive ownership for the replacement drive, you need to disable automatic drive assignment if it is enabled.

    Note You manually assign drive ownership and then reenable automatic drive assignment later in this procedure.
    1. Verify whether automatic drive assignment is enabled:

      storage disk option show

      You can enter the command on either controller.

      If automatic drive assignment is enabled, the output shows on in the Auto Assign column (for each controller).

    2. If automatic drive assignment is enabled, disable it:

      storage disk option modify -node node_name -autoassign off

      You must disable automatic drive assignment on both controllers.

  2. Properly ground yourself.

  3. Physically identify the failed drive.

    When a drive fails, the system logs a warning message to the system console indicating which drive failed. Additionally, the attention (amber) LED on the drive shelf operator display panel and the failed drive illuminate.

    Note The activity (green) LED on a failed drive can be illuminated (solid), which indicates that the drive has power, but should not be blinking, which indicates I/O activity. A failed drive has no I/O activity.
  4. Remove the failed drive:

    Remove NVMe drive
    Callout number 1

    Press the release button on the drive face to open the cam handle.

    Callout number 2

    Rotate the cam handle downward to disengage the drive from the midplane.

    Callout number 3

    Slide the drive out of the drive bay using the cam handle and supporting the drive with your other hand.

    When removing a drive, always use two hands to support its weight.

    Note Because drives are fragile, minimize handling to avoid damaging them.
  5. Wait a minimum of 70 seconds before inserting the replacement drive.

  6. Insert the replacement drive:

    1. With the cam handle in the open position, use both hands to insert the drive.

    2. Gently push until the drive stops.

    3. Close the cam handle so that the drive is fully seated into the midplane and the handle clicks into place.

      Be sure to close the cam handle slowly so that it aligns correctly with the face of the drive.

  7. Verify that the drive's activity (green) LED is illuminated.

    When the drive's activity LED is solid, it means that the drive has power. When the drive's activity LED is blinking, it means that the drive has power and I/O is in progress. If the drive firmware is automatically updating, the LED blinks.

  8. If you are replacing another drive, repeat steps 3 through step 7.

  9. If you disabled automatic drive assignment in step 1, manually assign drive ownership, and then reenable automatic drive assignment if needed:

    1. Display all unowned drives:

      storage disk show -container-type unassigned

      You can enter the command on either controller.

    2. Assign each drive:

      storage disk assign -disk disk_name -owner owner_name

      You can enter the command on either controller.

      You can use the wildcard character to assign more than one drive at once.

    3. Reenable automatic drive assignment if needed:

      storage disk option modify -node node_name -autoassign on

      You must reenable automatic drive assignment on both controllers.

  10. Return the failed part to NetApp, as described in the RMA instructions shipped with the kit.

    Contact technical support at NetApp Support, 888-463-8277 (North America), 00-800-44-638277 (Europe), or +800-800-80-800 (Asia/Pacific) if you need the RMA number or additional help with the replacement procedure.