Recovering from a multi-controller or storage failure
PDF of this doc site
Install a fabric-attached MetroCluster
Cable a fabric-attached MetroCluster configuration
- Install and cable MetroCluster components
- Configure hardware for sharing a Brocade 6510 FC fabric during transition
Plan and install a MetroCluster configuration with array LUNs
- Install and cable the MetroCluster components in a configuration with array LUNs
- Cable a fabric-attached MetroCluster configuration
Install a MetroCluster IP configuration
- Configure the MetroCluster hardware components
Install a stretch MetroCluster configuration
- Cable a two-node SAS-attached stretch MetroCluster configuration
- Cable a two-node bridge-attached stretch MetroCluster configuration
- Install and Configure MetroCluster Tiebreaker
- Perform switchover, healing, and switchback
Maintain the MetroCluster components
- Hot add storage to a MetroCluster FC configuration
Transition from MetroCluster FC to MetroCluster IP
- Transition nondisruptively from a MetroCluster FC to a MetroCluster IP configuration (ONTAP 9.8 and later)
Upgrade, refresh, or expand the MetroCluster configuration
- Expand a two-node MetroCluster FC configuration to a four-node configuration
- Install a fabric-attached MetroCluster
If the controller failure extends to all controller modules on one side of a DR group in a MetroCluster configuration (including a single controller in a two-node MetroCluster configuration), or storage has been replaced, you must replace the equipment and reassign ownership of drives to recover from the disaster.
You should review the available recovery procedures before deciding to use this procedure.
The disaster site must be fenced off.
Switchover must have been performed.
Replacement drives and the controller modules must be new and must not have been assigned ownership previously.
The examples in this procedure show two or four-node configurations. If you have an eight-node configuration (two DR groups), you must take into account any failures and perform the required recovery task on the additional controller modules.
This procedure uses the following workflow:
This procedure can be used when performing recovery on a system that was in mid-transition when the failure occurred. In that case, you must perform the appropriate steps when preparing for switchback, as indicated in the procedure.