Hot-swap a drive - NS224 shelves

Contributors netapp-lisa netapp-martyh

You can replace a failed drive nondisruptively in an NS224 drive shelf that is powered on, and while I/O is in progress.

Before you begin
  • The drive that you are installing must be supported by the NS224 shelf.

  • If SED authentication is enabled, you must use the SED replacement instructions in the ONTAP documentation.

    Instructions in the ONTAP documentation describe additional steps you must perform before and after replacing an SED.

  • All other components in the system must be functioning properly; if not, contact technical support.

  • Verify that the drive you are removing is failed.

    You can verify that the drive is failed by running the storage disk show -broken command. The failed drive appears in the list of failed drives. If it does not, you should wait, and then run the command again.

    Note Depending on the drive type and capacity, it can take up to several hours for the drive to appear in the list of failed drives.
About this task
  • Best practice: The best practice is to have the current version of the Disk Qualification Package (DQP) installed before hot-swapping a drive.

    Having the current version of the DQP installed allows your system to recognize and use newly qualified drives. This avoids system event messages about having noncurrent drive information and prevention of drive partitioning because drives are not recognized. The DQP also notifies you of noncurrent drive firmware.

  • Best practice: The best practice is to have current versions of NVMe shelf module (NSM) firmware and drive firmware on your system before replacing FRU components.

    Note

    Do not revert firmware to a version that does not support your shelf and its components.

  • Drive firmware is automatically updated (nondisruptively) on new drives that have non current firmware versions.

    Note Drive firmware checks occur every two minutes.
  • If needed, you can turn on the shelf’s location (blue) LEDs to aid in physically locating the affected shelf: storage shelf location-led modify -shelf-name shelf_name -led-status on

    If you do not know the shelf_name of the affected shelf, run the storage shelf show command.

    A shelf has three location LEDs: one on the operator display panel and one on each NSM module. Location LEDs remain illuminated for 30 minutes. You can turn them off by entering the same command, but using the off option.

  • When you unpack the replacement drive, save all packing materials for use when you return the failed drive.

    If you need the RMA number or additional help with the replacement procedure, contact technical support at NetApp Support, 888-463-8277 (North America), 00-800-44-638277 (Europe), or +800-800-80-800 (Asia/Pacific).

  • The following video provides an overview of the physical removal and insertion portions of the drive hot-swap procedure.

Steps
  1. If you want to manually assign drive ownership for the replacement drive, you need to disable automatic drive assignment if it is enabled.

    Note You need to manually assign drive ownership if drives in the shelf are owned by both controller modules in the HA pair.
    Note You manually assign drive ownership and then reenable automatic drive assignment later in this procedure.
    1. Verify whether automatic drive assignment is enabled: storage disk option show

      You can enter the command on either controller module.

      If automatic drive assignment is enabled, the output shows on in the Auto Assign column (for each controller module).

    2. If automatic drive assignment is enabled, disable it: storage disk option modify -node node_name -autoassign off

      You must disable automatic drive assignment on both controller modules.

  2. Properly ground yourself.

  3. Physically identify the failed drive.

    When a drive fails, the system logs a warning message to the system console indicating which drive failed. Additionally, the attention (amber) LED on the drive shelf operator display panel and the failed drive illuminate.

    Note The activity (green) LED on a failed drive can be illuminated (solid), which indicates that the drive has power, but should not be blinking, which indicates I/O activity. A failed drive has no I/O activity.
  4. Remove the failed drive:

    1. Press the release button on the drive face to open the cam handle.

    2. Slide the drive out of the shelf using the cam handle and supporting the drive with your other hand.

  5. Wait a minimum of 70 seconds before inserting the replacement drive.

    This allows the system to recognize that a drive was removed.

  6. Insert the replacement drive:

    1. With the cam handle in the open position, use both hands to insert the replacement drive.

    2. Push until the drive stops.

    3. Close the cam handle so that the drive is fully seated into the mid plane and the handle clicks into place.

      Be sure to close the cam handle slowly so that it aligns correctly with the face of the drive.

  7. Verify that the drive’s activity (green) LED is illuminated.

    When the drive’s activity LED is solid, it means that the drive has power. When the drive’s activity LED is blinking, it means that the drive has power and I/O is in progress. If the drive firmware is automatically updating, the LED blinks.

  8. If you are replacing another drive, repeat steps 3 through step 7.

  9. If you disabled automatic drive assignment in step 1, manually assign drive ownership, and then reenable automatic drive assignment if needed:

    1. Display all unowned drives: storage disk show -container-type unassigned

      You can enter the command on either controller module.

    2. Assign each drive: storage disk assign -disk disk_name -owner owner_name

      You can enter the command on either controller module.

      You can use the wildcard character to assign more than one drive at once.

    3. Reenable automatic drive assignment if needed: storage disk option modify -node node_name -autoassign on

      You must reenable automatic drive assignment on both controller modules.