Replace a DIMM - FAS2820
You must replace a DIMM in the controller when your storage system encounters errors such as, excessive CECC (Correctable Error Correction Codes) errors that are based on Health Monitor alerts or uncorrectable ECC errors, typically caused by a single DIMM failure preventing the storage system from booting ONTAP.
All other components in the system must be functioning properly; if not, you must contact technical support.
You must replace the failed component with a replacement FRU component you received from your provider.
Step 1: Shut down the impaired controller
To shut down the impaired controller, you must determine the status of the controller and, if necessary, take over the controller so that the healthy controller continues to serve data from the impaired controller storage.
If you have a cluster with more than two nodes, it must be in quorum. If the cluster is not in quorum or a healthy controller shows false for eligibility and health, you must correct the issue before shutting down the impaired controller; see Synchronize a node with the cluster.
-
If AutoSupport is enabled, suppress automatic case creation by invoking an AutoSupport message:
system node autosupport invoke -node * -type all -message MAINT=_number_of_hours_down_h
The following AutoSupport message suppresses automatic case creation for two hours:
cluster1:*> system node autosupport invoke -node * -type all -message MAINT=2h
-
If the impaired controller is part of an HA pair, disable automatic giveback from the console of the healthy controller:
storage failover modify -node local -auto-giveback false
-
Take the impaired controller to the LOADER prompt:
If the impaired controller is displaying… Then… The LOADER prompt
Go to Remove controller module.
Waiting for giveback…
Press Ctrl-C, and then respond
y
.System prompt or password prompt (enter system password)
Take over or halt the impaired controller from the healthy controller:
storage failover takeover -ofnode impaired_node_name
When the impaired controller shows Waiting for giveback…, press Ctrl-C, and then respond
y
.
Step 2: Remove controller module
Remove the controller module from the system and then remove the controller module cover.
-
If you are not already grounded, properly ground yourself.
-
Loosen the hook and loop strap binding the cables to the cable management device, and then unplug the system cables and SFPs (if needed) from the controller module, keeping track of where the cables were connected.
Leave the cables in the cable management device so that when you reinstall the cable management device, the cables are organized.
-
Remove and set aside the cable management devices from the left and right sides of the controller module.
-
Squeeze the latch on the cam handle until it releases, open the cam handle fully to release the controller module from the midplane, and then, using two hands, pull the controller module out of the chassis.
-
Turn the controller module over and place it on a flat, stable surface.
-
Open the cover by pressing the blue buttons on the sides of the controller module to release the cover, and then rotate the cover up and off of the controller module.
Controller module cover release button |
Step 3: Replace the DIMMs
Locate the DIMM inside the controller, remove it, and replace it.
Before replacing a DIMM, you need to unplug the NVMEM battery from the controller module. |
-
If you are not already grounded, properly ground yourself.
You must perform a clean system shutdown before replacing system components to avoid losing unwritten data in the nonvolatile memory (NVMEM). The LED is located on the back of the controller module. Look for the following icon:
-
If the NVMEM LED is not flashing, there is no content in the NVMEM; you can skip the following steps and proceed to the next task in this procedure.
-
If the NVMEM LED is flashing, there is data in the NVMEM and you must disconnect the battery to clear the memory:
-
Remove the battery from the controller module by pressing the blue button on the side of the controller module.
-
Slide the battery up until it clears the holding brackets, and then lift the battery out of the controller module.
-
Locate the battery cable, press the clip on the battery plug to release the lock clip from the plug socket, and then unplug the battery cable from the socket.
-
Confirm that the NVMEM LED is no longer lit.
-
Reconnect the battery connector and recheck the LED on the back of the controller.
-
Unplug the battery cable.
-
-
Locate the DIMMs on your controller module.
-
Note the orientation and location of the DIMM in the socket so that you can insert the replacement DIMM in the proper orientation.
-
Eject the DIMM from its slot by slowly pushing apart the two DIMM ejector tabs on either side of the DIMM, and then slide the DIMM out of the slot.
The DIMM will rotate up a little.
-
Rotate the DIMM as far as it will go, and then slide the DIMM out of the socket.
Carefully hold the DIMM by the edges to avoid pressure on the components on the DIMM circuit board. NVRAM battery release button
NVRAM battery plug
DIMM ejector tabs
DIMMs
-
Remove the replacement DIMM from the antistatic shipping bag, hold the DIMM by the corners, and align it to the slot.
The notch among the pins on the DIMM should line up with the tab in the socket.
-
Insert the DIMM squarely into the slot.
The DIMM fits tightly in the slot, but should go in easily. If not, realign the DIMM with the slot and reinsert it.
Visually inspect the DIMM to verify that it is evenly aligned and fully inserted into the slot. -
Push carefully, but firmly, on the top edge of the DIMM until the ejector tabs snap into place over the notches at the ends of the DIMM.
-
Reconnect the NVMRM battery:
-
Plug in the NVRAM battery.
Make sure that the plug locks down into the battery power socket on the motherboard.
-
Align the battery with the holding brackets on the sheet metal side wall.
-
Slide the battery pack down until the battery latch engages and clicks into the opening on the side wall.
-
-
Reinstall the controller module cover.
Step 4: Reinstall the controller module
Reinstall the controller module into the chassis.
-
If you are not already grounded, properly ground yourself.
-
If you have not already done so, replace the cover on the controller module.
-
Turn the controller module over and align the end with the opening in the chassis.
-
Gently push the controller module halfway into the system.Align the end of the controller module with the opening in the chassis, and then gently push the controller module halfway into the system.
Do not completely insert the controller module in the chassis until instructed to do so. -
Recable the system, as needed.
If you removed the media converters (QSFPs or SFPs), remember to reinstall them if you are using fiber optic cables.
-
Complete the reinstallation of the controller module:
-
With the cam handle in the open position, firmly push the controller module in until it meets the midplane and is fully seated, and then close the cam handle to the locked position.
Do not use excessive force when sliding the controller module into the chassis to avoid damaging the connectors. The controller begins to boot as soon as it is seated in the chassis.
-
If you have not already done so, reinstall the cable management device.
-
Bind the cables to the cable management device with the hook and loop strap.
-
-
Reboot the controller module.
During the boot process, you might see the following prompts: -
A prompt warning of a system ID mismatch and asking to override the system ID.
-
A prompt warning that when entering Maintenance mode in an HA configuration you must ensure that the healthy controller remains down. You can safely respond
y
to these prompts.
-
Step 5: Restore automatic giveback and AutSupport
Restore automatic giveback and AutoSupport if they have been disabled.
-
Restore automatic giveback by using the
storage failover modify -node local -auto-giveback true
command. -
If an AutoSupport maintenance window was triggered, end it by using the
system node autosupport invoke -node * -type all -message MAINT=END
command.
Step 6: Return the failed part to NetApp
Return the failed part to NetApp, as described in the RMA instructions shipped with the kit. See the Part Return and Replacements page for further information.