Replace a DIMM - AFF A250
You must replace a DIMM in the controller when your storage system encounters errors such as, excessive CECC (Correctable Error Correction Codes) errors that are based on Health Monitor alerts or uncorrectable ECC errors, typically caused by a single DIMM failure preventing the storage system from booting ONTAP.
All other components in the system must be functioning properly; if not, you must contact technical support.
You must replace the failed component with a replacement FRU component you received from your provider.
Step 1: Shut down the impaired controller
To shut down the impaired controller, you must determine the status of the controller and, if necessary, take over the controller so that the healthy controller continues to serve data from the impaired controller storage.
-
If you have a SAN system, you must have checked event messages (
cluster kernel-service show
) for the impaired controller SCSI blade. Thecluster kernel-service show
command (from priv advanced mode) displays the node name, quorum status of that node, availability status of that node, and operational status of that node.Each SCSI-blade process should be in quorum with the other nodes in the cluster. Any issues must be resolved before you proceed with the replacement.
-
If you have a cluster with more than two nodes, it must be in quorum. If the cluster is not in quorum or a healthy controller shows false for eligibility and health, you must correct the issue before shutting down the impaired controller; see Synchronize a node with the cluster.
-
If AutoSupport is enabled, suppress automatic case creation by invoking an AutoSupport message:
system node autosupport invoke -node * -type all -message MAINT=<# of hours>h
The following AutoSupport message suppresses automatic case creation for two hours:
cluster1:> system node autosupport invoke -node * -type all -message MAINT=2h
-
Disable automatic giveback from the console of the healthy controller:
storage failover modify –node local -auto-giveback false
When you see Do you want to disable auto-giveback?, enter y
. -
Take the impaired controller to the LOADER prompt:
If the impaired controller is displaying… Then… The LOADER prompt
Go to the next step.
Waiting for giveback…
Press Ctrl-C, and then respond
y
when prompted.System prompt or password prompt
Take over or halt the impaired controller from the healthy controller:
storage failover takeover -ofnode impaired_node_name
When the impaired controller shows Waiting for giveback…, press Ctrl-C, and then respond
y
.
Step 2: Remove the controller module
You must remove the controller module from the chassis when you replace a component inside the controller module.
Make sure that you label the cables so that you know where they came from.
-
If you are not already grounded, properly ground yourself.
-
Unplug the controller module power supplies from the source.
-
Release the power cable retainers, and then unplug the cables from the power supplies.
-
Insert your forefinger into the latching mechanism on either side of the controller module, press the lever with your thumb, and gently pull the controller a few inches out of the chassis.
If you have difficulty removing the controller module, place your index fingers through the finger holes from the inside (by crossing your arms). Lever
Latching mechanism
-
Using both hands, grasp the controller module sides and gently pull it out of the chassis and set it on a flat, stable surface.
-
Turn the thumbscrew on the front of the controller module anti-clockwise and open the controller module cover.
Thumbscrew
Controller module cover.
-
Lift out the air duct cover.
Step 3: Replace a DIMM
To replace a DIMM, you must locate it in the controller module using the DIMM map label on top of the air duct and then replace it following the specific sequence of steps.
Use the following video or the tabulated steps to replace a DIMM:
-
Replace the impaired DIMM on your controller module.
The DIMMs are in slot 3 or 1 on the motherboard. Slot 2 and 4 are left empty. Do not attempt to install DIMMs into these slots.
-
Note the orientation of the DIMM in the socket so that you can insert the replacement DIMM in the proper orientation.
-
Slowly push apart the DIMM ejector tabs on either side of the DIMM, and slide the DIMM out of the slot.
-
Leave DIMM ejector tabs on the connector in the open position.
-
Remove the replacement DIMM from the antistatic shipping bag, hold the DIMM by the corners, and align it to the slot.
Hold the DIMM by the edges to avoid pressure on the components on the DIMM circuit board. -
Insert the replacement DIMM squarely into the slot.
The DIMMs fit tightly in the socket. If not, reinsert the DIMM to realign it with the socket.
-
Visually inspect the DIMM to verify that it is evenly aligned and fully inserted into the socket.
Step 4: Install the controller module
After you have replaced the component in the controller module, you must reinstall the controller module into the chassis, and then boot it.
You can use the following illustrations or the written steps to install the replacement controller module in the chassis.
-
If you have not already done so, install the air duct.
-
Close the controller module cover and tighten the thumbscrew.
Controller module cover
Thumbscrew
-
Insert the controller module into the chassis:
-
Ensure the latching mechanism arms are locked in the fully extended position.
-
Using both hands, align and gently slide the controller module into the latching mechanism arms until it stops.
-
Place your index fingers through the finger holes from the inside of the latching mechanism.
-
Press your thumbs down on the orange tabs on top of the latching mechanism and gently push the controller module over the stop.
-
Release your thumbs from the top of the latching mechanisms and continue pushing until the latching mechanisms snap into place.
The controller module begins to boot as soon as it is fully seated in the chassis.
The controller module should be fully inserted and flush with the edges of the chassis.
-
-
Cable the management and console ports only, so that you can access the system to perform the tasks in the following sections.
You will connect the rest of the cables to the controller module later in this procedure.
Step 5: Return the failed part to NetApp
Return the failed part to NetApp, as described in the RMA instructions shipped with the kit. See the Part Return and Replacements page for further information.