Upgrade controllers in a MetroCluster FC configuration using switchover and switchback

09/24/2025 Contributors

PDFs

You can use the MetroCluster switchover operation to provide nondisruptive service to clients while the controller modules on the partner cluster are upgraded. Other components (such as storage shelves or switches) cannot be upgraded as part of this procedure.

Supported platform combinations

You can upgrade certain platforms using the switchover and switchback operation in a MetroCluster FC configuration.

For information on what platform upgrade combinations are supported review the MetroCluster FC upgrade table in Choose a controller upgrade procedure.

Refer to Choosing an upgrade or refresh method for additional procedures.

About this task

You can use this procedure only for controller upgrade.

Other components in the configuration, such as storage shelves or switches, cannot be upgraded at the same time.
You can use this procedure with certain ONTAP versions:
- Two-node configurations are supported in ONTAP 9.3 and later.
- Four- and eight-node configurations are supported in ONTAP 9.8 and later.
  
  Do not use this procedure on four- or eight-node configurations running ONTAP versions prior to 9.8.

Your original and new platforms must be compatible and supported.

NetApp Hardware Universe

If the original or new platforms are FAS8020 or AFF8020 systems using ports 1c and 1d in FC-VI mode, see the Knowledge Base article Upgrading controllers when FCVI connections on existing FAS8020 or AFF8020 nodes use ports 1c and 1d.

The licenses at both sites must match. You can obtain new licenses from NetApp Support.
This procedure applies to controller modules in a MetroCluster FC configuration (a two-node stretch MetroCluster or a two, four-node, or eight-node fabric-attached MetroCluster configuration).
All controllers in the same DR group should be upgraded during the same maintenance period.

Operating the MetroCluster configuration with different controller types in the same DR group is not supported outside of this maintenance activity. For eight-node MetroCluster configurations, the controllers within a DR Group must be the same, but both DR groups can use different controller types.
Mapping of storage, FC and Ethernet connections between original nodes and new nodes in advance is recommended.
If the new platform has fewer slots than the original system, or if it has fewer or different types of ports, you might need to add an adapter to the new system.

For more information, see the NetApp Hardware Universe

The following example names are used in this procedure:

site_A
- Before upgrade:
  - node_A_1-old
  - node_A_2-old
- After upgrade:
  - node_A_1-new
  - node_A_2-new
site_B
- Before upgrade:
  - node_B_1-old
  - node_B_2-old
- After upgrade:
  - node_B_1-new
  - node_B_2-new

Enable console logging

NetApp strongly recommends that you enable console logging on the devices that you are using and take the following actions when performing this procedure:

Leave AutoSupport enabled during maintenance.
Trigger a maintenance AutoSupport message before and after maintenance to disable case creation for the duration of the maintenance activity.

See the Knowledge Base article How to suppress automatic case creation during scheduled maintenance windows.
Enable session logging for any CLI session. For instructions on how to enable session logging, review the "Logging Session Output" section in the Knowledge Base article How to configure PuTTY for optimal connectivity to ONTAP systems.

Prepare for the upgrade

Before making any changes to the existing MetroCluster configuration, you must check the health of the configuration, prepare the new platforms, and perform other miscellaneous tasks.

Verify the health of the MetroCluster configuration

You verify the health and connectivity of the MetroCluster configuration before performing the upgrade.

After you upgrade the controllers at the first site and before you upgrade the second, running metrocluster check run followed by metrocluster check show returns an error in the config-replication field. This error indicates an NVRAM size mismatch between the nodes at each site and it's the expected behavior when there are different platform models on both sites. You can ignore the error until the controller upgrade is completed for all nodes in the DR group.

Steps

Verify the operation of the MetroCluster configuration in ONTAP:
1. Check whether the nodes are multipathed:
  node run -node node-name sysconfig -a
  
  You should issue this command for each node in the MetroCluster configuration.
2. Verify that there are no broken disks in the configuration:
  
  storage disk show -broken
  
  You should issue this command on each node in the MetroCluster configuration.
3. Check for any health alerts:
  
  system health alert show
  
  You should issue this command on each cluster.
4. Verify the licenses on the clusters:
  
  system license show
  
  You should issue this command on each cluster.
5. Verify the devices connected to the nodes:
  
  network device-discovery show
  
  You should issue this command on each cluster.
6. Verify that the time zone and time are set correctly on both sites:
  
  cluster date show
  
  You should issue this command on each cluster. You can use the cluster date commands to configure the time and time zone.
Check for any health alerts on the switches (if present):

storage switch show

You should issue this command on each cluster.
Confirm the operational mode of the MetroCluster configuration and perform a MetroCluster check.
1. Confirm the MetroCluster configuration and that the operational mode is normal:
  
  metrocluster show
2. Confirm that all expected nodes are shown:
  
  metrocluster node show
3. Issue the following command:
  
  metrocluster check run
4. Display the results of the MetroCluster check:
  
  metrocluster check show
Check the MetroCluster cabling with the Config Advisor tool.
1. Download and run Config Advisor.
  
  NetApp Downloads: Config Advisor
2. After running Config Advisor, review the tool's output and follow the recommendations in the output to address any issues discovered.

Map ports from the old nodes to the new nodes

You must plan the mapping of the LIFs on physical ports on the old nodes to the physical ports on the new nodes.

About this task

When the new node is first booted during the upgrade process, it will replay the most recent configuration of the old node it is replacing. When you boot node_A_1-new, ONTAP attempts to host LIFs on the same ports that were used on node_A_1-old. Therefore, as part of the upgrade you must adjust the port and LIF configuration so it is compatible with that of the old node. During the upgrade procedure, you will perform steps on both the old and new nodes to ensure correct cluster, management, and data LIF configuration.

The following table shows examples of configuration changes related to the port requirements of the new nodes.

Cluster interconnect physical ports
Old controller	New controller	Required action
e0a, e0b	e3a, e3b	No matching port. After the upgrade, recreate the cluster ports. Prepare cluster ports on an existing controller module
e0c, e0d	e0a,e0b,e0c,e0d	e0c and e0d are matching ports. You do not have to change the configuration, but after upgrade you can spread your cluster LIFs across the available cluster ports.

Cluster interconnect physical ports

Old controller

New controller

Required action

e0a, e0b

e3a, e3b

No matching port. After the upgrade, recreate the cluster ports. Prepare cluster ports on an existing controller module

e0c, e0d

e0a,e0b,e0c,e0d

e0c and e0d are matching ports. You do not have to change the configuration, but after upgrade you can spread your cluster LIFs across the available cluster ports.

Steps

Determine what physical ports are available on the new controllers and what LIFs can be hosted on the ports.

The controller's port usage depends on the platform module and which switches you will use in the MetroCluster IP configuration. You can gather the port usage of the new platforms from the NetApp Hardware Universe.

Also identify the FC-VI card slot usage.

Plan your port usage and, if desired, fill in the following tables for reference for each of the new nodes.

You will refer to the table as you carry out the upgrade procedure.

node_A_1-old

node_A_1-new

LIF

Ports

IPspaces

Broadcast domains

Ports

IPspaces

Broadcast domains

Cluster 1

Cluster 2

Cluster 3

Cluster 4

Node management

Cluster management

Data 1

Data 2

Data 3

Data 4

SAN

Intercluster port

Gather information before the upgrade

Before upgrading, you must gather information for each of the old nodes, and, if necessary, adjust the network broadcast domains, remove any VLANs and interface groups, and gather encryption information.

About this task

This task is performed on the existing MetroCluster FC configuration.

Steps

Label the cables for the existing controllers, to allow easy identification of cables when setting up the new controllers.

Gather the system IDs of the nodes in the MetroCluster configuration:

metrocluster node show -fields node-systemid,dr-partner-systemid

During the upgrade procedure, you will replace these old system IDs with the system IDs of the new controller modules.

In this example for a four-node MetroCluster FC configuration, the following old system IDs are retrieved:

node_A_1-old: 4068741258
node_A_2-old: 4068741260
node_B_1-old: 4068741254

node_B_2-old: 4068741256

metrocluster-siteA::> metrocluster node show -fields node-systemid,ha-partner-systemid,dr-partner-systemid,dr-auxiliary-systemid
dr-group-id   cluster                       node                   node-systemid          ha-partner-systemid     dr-partner-systemid    dr-auxiliary-systemid
-----------        ------------------------- ------------------    -------------                   -------------------                 -------------------              ---------------------
1                    Cluster_A                  Node_A_1-old   4068741258              4068741260                        4068741256                    4068741256
1                    Cluster_A                    Node_A_2-old   4068741260              4068741258                        4068741254                    4068741254
1                    Cluster_B                    Node_B_1-old   4068741254              4068741256                         4068741258                    4068741260
1                    Cluster_B                    Node_B_2-old   4068741256              4068741254                        4068741260                    4068741258
4 entries were displayed.

In this example for a two-node MetroCluster FC configuration, the following old system IDs are retrieved:

node_A_1: 4068741258
node_B_1: 4068741254

metrocluster node show -fields node-systemid,dr-partner-systemid

dr-group-id cluster    node      node-systemid dr-partner-systemid
----------- ---------- --------  ------------- ------------
1           Cluster_A  Node_A_1-old  4068741258    4068741254
1           Cluster_B  node_B_1-old  -             -
2 entries were displayed.

Gather port and LIF information for each old node.

You should gather the output of the following commands for each node:
- network interface show -role cluster,node-mgmt
- network port show -node node-name -type physical
- network port vlan show -node node-name
- network port ifgrp show -node node_name -instance
- network port broadcast-domain show
- network port reachability show -detail
- network ipspace show
- volume show
- storage aggregate show
- system node run -node node-name sysconfig -a
If the MetroCluster nodes are in a SAN configuration, collect the relevant information.

You should gather the output of the following commands:
- fcp adapter show -instance
- fcp interface show -instance
- iscsi interface show
- ucadmin show
If the root volume is encrypted, collect and save the passphrase used for key-manager:

security key-manager backup show
If the MetroCluster nodes are using encryption for volumes or aggregates, copy information about the keys and passphrases.

For additional information, see Backing up onboard key management information manually.
1. If Onboard Key Manager is configured:
  
  security key-manager onboard show-backup
  
  You will need the passphrase later in the upgrade procedure.
2. If enterprise key management (KMIP) is configured, issue the following commands:
  
  security key-manager external show -instance
  
  security key-manager key query

Remove the existing configuration from the Tiebreaker or other monitoring software

If the existing configuration is monitored with the MetroCluster Tiebreaker configuration or other third-party applications (for example, ClusterLion) that can initiate a switchover, you must remove the MetroCluster configuration from the Tiebreaker or other software prior to transition.

Steps

Remove the existing MetroCluster configuration from the Tiebreaker software.

Remove MetroCluster configurations
Remove the existing MetroCluster configuration from any third-party application that can initiate switchover.

Refer to the documentation for the application.

Send a custom AutoSupport message prior to maintenance

Before performing the maintenance, you should issue an AutoSupport message to notify NetApp technical support that maintenance is underway. Informing technical support that maintenance is underway prevents them from opening a case on the assumption that a disruption has occurred.

About this task

This task must be performed on each MetroCluster site.

Steps

To prevent automatic support case generation, send an Autosupport message to indicate maintenance is underway.
1. Issue the following command:
  
  system node autosupport invoke -node * -type all -message MAINT=maintenance-window-in-hours
  
  maintenance-window-in-hours specifies the length of the maintenance window, with a maximum of 72 hours. If the maintenance is completed before the time has elapsed, you can invoke an AutoSupport message indicating the end of the maintenance period:
  
  system node autosupport invoke -node * -type all -message MAINT=end
2. Repeat the command on the partner cluster.

Switch over the MetroCluster configuration

You must switch over the configuration to site_A so that the platforms on site_B can be upgraded.

About this task

This task must be performed on site_A.

After completing this task, cluster_A is active and serving data for both sites. cluster_B is inactive, and ready to begin the upgrade process, as shown in the following illustration.

Site_B inactive and ready for upgrade after MetroCluster switchover

Steps

Switch over the MetroCluster configuration to site_A so that site_B's nodes can be upgraded:
1. Select the option that matches your configuration and issue the correct command on cluster_A:
  
  Option 1: Four- or eight-node FC configuration running ONTAP 9.8 or later
  
  Run the command: metrocluster switchover -controller-replacement true
  
  Option 2: Two-node FC configuration running ONTAP 9.3 and later
  
  Run the command: metrocluster switchover
  
  The operation can take several minutes to complete.
2. Monitor the switchover operation:
  
  metrocluster operation show
3. After the operation is complete, confirm that the nodes are in switchover state:
  
  metrocluster show
4. Check the status of the MetroCluster nodes:
  
  metrocluster node show
Heal the data aggregates.
1. Heal the data aggregates:
  
  metrocluster heal data-aggregates
2. Confirm the heal operation is complete by running the metrocluster operation show command on the healthy cluster:
  cluster_A::> metrocluster operation show Operation: heal-aggregates State: successful Start Time: 7/29/2020 20:54:41 End Time: 7/29/2020 20:54:42 Errors: -
Heal the root aggregates.
1. Heal the data aggregates:
  
  metrocluster heal root-aggregates
2. Confirm the heal operation is complete by running the metrocluster operation show command on the healthy cluster:
  cluster_A::> metrocluster operation show Operation: heal-root-aggregates State: successful Start Time: 7/29/2020 20:58:41 End Time: 7/29/2020 20:59:42 Errors: -

Prepare the network configuration of the old controllers

To ensure that the networking resumes cleanly on the new controllers, you must move LIFs to a common port and then remove the networking configuration of the old controllers.

About this task

This task must be performed on each of the old nodes.
You will use the information gathered in Mapping ports from the old nodes to the new nodes.

Steps

Boot the old nodes and then log in to the nodes:

boot_ontap
Assign the home port of all data LIFs on the old controller to a common port that is the same on both the old and new controller modules.
1. Display the LIFs:
  
  network interface show
  
  All data LIFS including SAN and NAS will be admin up and operationally down since those are up at switchover site (cluster_A).
2. Review the output to find a common physical network port that is the same on both the old and new controllers that is not used as a cluster port.
  
  For example, e0d is a physical port on old controllers and is also present on new controllers. e0d is not used as a cluster port or otherwise on the new controllers.
  
  For port usage for platform models, see the NetApp Hardware Universe
3. Modify all data LIFS to use the common port as the home port:
  
  network interface modify -vserver svm-name -lif data-lif -home-port port-id
  
  In the following example, this is "e0d".
  
  For example:
  network interface modify -vserver vs0 -lif datalif1 -home-port e0d
Modify broadcast domains to remove vlan and physical ports that need to be deleted:

broadcast-domain remove-ports -broadcast-domain broadcast-domain-name -ports node-name:port-id

Repeat this step for all VLAN and physical ports.
Remove any VLAN ports using cluster ports as member ports and ifgrps using cluster ports as member ports.
1. Delete VLAN ports:
  
  network port vlan delete -node node-name -vlan-name portid-vlandid
  
  For example:
  network port vlan delete -node node1 -vlan-name e1c-80
2. Remove physical ports from the interface groups:
  
  network port ifgrp remove-port -node node-name -ifgrp interface-group-name -port portid
  
  For example:
  network port ifgrp remove-port -node node1 -ifgrp a1a -port e0d
3. Remove VLAN and interface group ports from broadcast domain::
  
  network port broadcast-domain remove-ports -ipspace ipspace -broadcast-domain broadcast-domain-name -ports nodename:portname,nodename:portname,..
4. Modify interface group ports to use other physical ports as member as needed.:
  
  ifgrp add-port -node node-name -ifgrp interface-group-name -port port-id
Halt the nodes:

halt -inhibit-takeover true -node node-name

This step must be performed on both nodes.

Remove the old platforms

The old controllers must be removed from the configuration.

About this task

This task is performed on site_B.

Steps

Connect to the serial console of the old controllers (node_B_1-old and node_B_2-old) at site_B and verify it is displaying the LOADER prompt.
Disconnect the storage and network connections on node_B_1-old and node_B_2-old and label the cables so they can be reconnected to the new nodes.
Disconnect the power cables from node_B_1-old and node_B_2-old.
Remove the node_B_1-old and node_B_2-old controllers from the rack.

Configure the new controllers

You must rack and install the controllers, perform required setup in Maintenance mode, and then boot the controllers, and verify the LIF configuration on the controllers.

Set up the new controllers

You must rack and cable the new controllers.

Steps

Plan out the positioning of the new controller modules and storage shelves as needed.

The rack space depends on the platform model of the controller modules, the switch types, and the number of storage shelves in your configuration.
Properly ground yourself.
Install the controller modules in the rack or cabinet.

ONTAP Hardware Systems Documentation
If the new controller modules did not come with FC-VI cards of their own and if FC-VI cards from old controllers are compatible on new controllers, swap FC-VI cards and install those in correct slots.

See the NetApp Hardware Universe for slot info for FC-VI cards.
Cable the controllers' power, serial console and management connections as described in the MetroCluster Installation and Configuration Guides.

Do not connect any other cables that were disconnected from old controllers at this time.

ONTAP Hardware Systems Documentation
Power up the new nodes and press Ctrl-C when prompted to display the LOADER prompt.

Netboot the new controllers

After you install the new nodes, you need to netboot to ensure the new nodes are running the same version of ONTAP as the original nodes. The term netboot means you are booting from an ONTAP image stored on a remote server. When preparing for netboot, you must put a copy of the ONTAP 9 boot image onto a web server that the system can access.

This task is performed on each of the new controller modules.

Steps

Access the NetApp Support Site to download the files used for performing the netboot of the system.
Download the appropriate ONTAP software from the software download section of the NetApp Support Site and store the ontap-version_image.tgz file on a web-accessible directory.

Go to the web-accessible directory and verify that the files you need are available.

If the platform model is…	Then…
FAS/AFF8000 series systems	Extract the contents of the ontap-version_image.tgzfile to the target directory: tar -zxvf ontap-version_image.tgz NOTE: If you are extracting the contents on Windows, use 7-Zip or WinRAR to extract the netboot image. Your directory listing should contain a netboot folder with a kernel file:netboot/kernel
All other systems	Your directory listing should contain a netboot folder with a kernel file: ontap-version_image.tgz You do not need to extract the ontap-version_image.tgz file.

If the platform model is…

Then…

FAS/AFF8000 series systems

Extract the contents of the ontap-version_image.tgzfile to the target directory: tar -zxvf ontap-version_image.tgz

NOTE: If you are extracting the contents on Windows, use 7-Zip or WinRAR to extract the netboot image.

Your directory listing should contain a netboot folder with a kernel file:netboot/kernel

All other systems

Your directory listing should contain a netboot folder with a kernel file: ontap-version_image.tgz

You do not need to extract the ontap-version_image.tgz file.

At the LOADER prompt, configure the netboot connection for a management LIF:
- If IP addressing is DHCP, configure the automatic connection:
  
  ifconfig e0M -auto
- If IP addressing is static, configure the manual connection:
  
  ifconfig e0M -addr=ip_addr -mask=netmask -gw=gateway
Perform the netboot.
- If the platform is an 80xx series system, use this command:
  
  netboot http://web_server_ip/path_to_web-accessible_directory/netboot/kernel
- If the platform is any other system, use the following command:
  
  netboot http://web_server_ip/path_to_web-accessible_directory/ontap-version_image.tgz

From the boot menu, select option (7) Install new software first to download and install the new software image to the boot device.

Disregard the following message: "This procedure is not supported for Non-Disruptive Upgrade on an HA pair". It applies to nondisruptive upgrades of software, not to upgrades of controllers.

If you are prompted to continue the procedure, enter y, and when prompted for the package, enter the URL of the image file: http://web_server_ip/path_to_web-accessible_directory/ontap-version_image.tgz
```
Enter username/password if applicable, or press Enter to continue.
```
Be sure to enter n to skip the backup recovery when you see a prompt similar to the following:
```
Do you want to restore the backup configuration now? {y|n}
```

Reboot by entering y when you see a prompt similar to the following:

The node must be rebooted to start using the newly installed software. Do you want to reboot now? {y|n}

Clear the configuration on a controller module

Before using a new controller module in the MetroCluster configuration, you must clear the existing configuration.

Steps

If necessary, halt the node to display the LOADER prompt:

halt
At the LOADER prompt, set the environmental variables to default values:

set-defaults
Save the environment:

saveenv
At the LOADER prompt, launch the boot menu:

boot_ontap menu
At the boot menu prompt, clear the configuration:

wipeconfig

Respond yes to the confirmation prompt.

The node reboots and the boot menu is displayed again.
At the boot menu, select option 5 to boot the system into Maintenance mode.

Respond yes to the confirmation prompt.

Restore the HBA configuration

Depending on the presence and configuration of HBA cards in the controller module, you need to configure them correctly for your site's usage.

Steps

In Maintenance mode configure the settings for any HBAs in the system:

Check the current settings of the ports: ucadmin show
Update the port settings as needed.

If you have this type of HBA and desired mode…	Use this command…
CNA FC	`ucadmin modify -m fc -t initiator adapter-name`
CNA Ethernet	`ucadmin modify -mode cna adapter-name`
FC target	`fcadmin config -t target adapter-name`
FC initiator	`fcadmin config -t initiator adapter-name`

If you have this type of HBA and desired mode…

Use this command…

CNA FC

ucadmin modify -m fc -t initiator adapter-name

CNA Ethernet

ucadmin modify -mode cna adapter-name

FC target

fcadmin config -t target adapter-name

FC initiator

fcadmin config -t initiator adapter-name

Exit Maintenance mode:

halt

After you run the command, wait until the node stops at the LOADER prompt.
Boot the node back into Maintenance mode to enable the configuration changes to take effect:

boot_ontap maint
Verify the changes you made:

If you have this type of HBA…

Use this command…

CNA

ucadmin show

FC

fcadmin show

If you have this type of HBA…	Use this command…
CNA	`ucadmin show`
FC	`fcadmin show`

Set the HA state on the new controllers and chassis

You must verify the HA state of the controllers and chassis, and, if necessary, update the state to match your system configuration.

Steps

In Maintenance mode, display the HA state of the controller module and chassis:

ha-config show

The HA state for all components should be mcc.

If the MetroCluster configuration has…

The HA state should be…

Two nodes

mcc-2n

Four or eight nodes

mcc

If the MetroCluster configuration has…	The HA state should be…
Two nodes	mcc-2n
Four or eight nodes	mcc

If the displayed system state of the controller is not correct, set the HA state for the controller module and chassis:

If the MetroCluster configuration has…	Issue these commands…
Two nodes	`ha-config modify controller mcc-2n` `ha-config modify chassis mcc-2n`
Four or eight nodes	`ha-config modify controller mcc` `ha-config modify chassis mcc`

If the MetroCluster configuration has…

Issue these commands…

Two nodes

ha-config modify controller mcc-2n

ha-config modify chassis mcc-2n

Four or eight nodes

ha-config modify controller mcc

ha-config modify chassis mcc

Reassign root aggregate disks

Reassign the root aggregate disks to the new controller module, using the sysids gathered earlier

About this task

This task is performed in Maintenance mode.

The old system IDs were identified in Gather information before the upgrade.

The examples in this procedure use controllers with the following system IDs:

Node	Old system ID	New system ID
node_B_1	4068741254	1574774970

Node

Old system ID

New system ID

node_B_1

4068741254

1574774970

Steps

Cable all other connections to the new controller modules (FC-VI, storage, cluster interconnect, etc.).
Halt the system and boot to Maintenance mode from the LOADER prompt:

boot_ontap maint

Display the disks owned by node_B_1-old:

disk show -a

The command output shows the system ID of the new controller module (1574774970). However, the root aggregate disks are still owned by the old system ID (4068741254). This example does not show drives owned by other nodes in the MetroCluster configuration.

*> disk show -a
Local System ID: 1574774970

  DISK         OWNER                     POOL   SERIAL NUMBER    HOME                      DR HOME
------------   -------------             -----  -------------    -------------             -------------
...
rr18:9.126L44 node_B_1-old(4068741254)   Pool1  PZHYN0MD         node_B_1-old(4068741254)  node_B_1-old(4068741254)
rr18:9.126L49 node_B_1-old(4068741254)   Pool1  PPG3J5HA         node_B_1-old(4068741254)  node_B_1-old(4068741254)
rr18:8.126L21 node_B_1-old(4068741254)   Pool1  PZHTDSZD         node_B_1-old(4068741254)  node_B_1-old(4068741254)
rr18:8.126L2  node_B_1-old(4068741254)   Pool0  S0M1J2CF         node_B_1-old(4068741254)  node_B_1-old(4068741254)
rr18:8.126L3  node_B_1-old(4068741254)   Pool0  S0M0CQM5         node_B_1-old(4068741254)  node_B_1-old(4068741254)
rr18:9.126L27 node_B_1-old(4068741254)   Pool0  S0M1PSDW         node_B_1-old(4068741254)  node_B_1-old(4068741254)
...

Reassign the root aggregate disks on the drive shelves to the new controller:

disk reassign -s old-sysid -d new-sysid

The following example shows reassignment of drives:

*> disk reassign -s 4068741254 -d 1574774970
Partner node must not be in Takeover mode during disk reassignment from maintenance mode.
Serious problems could result!!
Do not proceed with reassignment if the partner is in takeover mode. Abort reassignment (y/n)? n

After the node becomes operational, you must perform a takeover and giveback of the HA partner node to ensure disk reassignment is successful.
Do you want to continue (y/n)? Jul 14 19:23:49 [localhost:config.bridge.extra.port:error]: Both FC ports of FC-to-SAS bridge rtp-fc02-41-rr18:9.126L0 S/N [FB7500N107692] are attached to this controller.
y
Disk ownership will be updated on all disks previously belonging to Filer with sysid 4068741254.
Do you want to continue (y/n)? y

Check that all disks are reassigned as expected:

disk show

*> disk show
Local System ID: 1574774970

  DISK        OWNER                      POOL   SERIAL NUMBER   HOME                      DR HOME
------------  -------------              -----  -------------   -------------             -------------
rr18:8.126L18 node_B_1-new(1574774970)   Pool1  PZHYN0MD        node_B_1-new(1574774970)  node_B_1-new(1574774970)
rr18:9.126L49 node_B_1-new(1574774970)   Pool1  PPG3J5HA        node_B_1-new(1574774970)  node_B_1-new(1574774970)
rr18:8.126L21 node_B_1-new(1574774970)   Pool1  PZHTDSZD        node_B_1-new(1574774970)  node_B_1-new(1574774970)
rr18:8.126L2  node_B_1-new(1574774970)   Pool0  S0M1J2CF        node_B_1-new(1574774970)  node_B_1-new(1574774970)
rr18:9.126L29 node_B_1-new(1574774970)   Pool0  S0M0CQM5        node_B_1-new(1574774970)  node_B_1-new(1574774970)
rr18:8.126L1  node_B_1-new(1574774970)   Pool0  S0M1PSDW        node_B_1-new(1574774970)  node_B_1-new(1574774970)
*>

Display the aggregate status:

aggr status

*> aggr status
           Aggr            State       Status           Options
aggr0_node_b_1-root    online      raid_dp, aggr    root, nosnap=on,
                           mirrored                     mirror_resync_priority=high(fixed)
                           fast zeroed
                           64-bit

Repeat the above steps on the partner node (node_B_2-new).

Boot up the new controllers

You must reboot the controllers from the boot menu to update the controller flash image. Additional steps are required if encryption is configured.

About this task

This task must be performed on all the new controllers.

Steps

Halt the node:

halt
If external key manager is configured, set the related bootargs:

setenv bootarg.kmip.init.ipaddr ip-address

setenv bootarg.kmip.init.netmask netmask

setenv bootarg.kmip.init.gateway gateway-address

setenv bootarg.kmip.init.interface interface-id
Display the boot menu:

boot_ontap menu

If root encryption is used, depending on the ONTAP version you are using, select the boot menu option or issue the boot menu command for your key management configuration.

ONTAP 9.8 and later

Beginning with ONTAP 9.8, select the boot menu option.

If you are using…	Select this boot menu option…
Onboard key management	Option “10” Follow the prompts to provide the required inputs to recover and restore the key-manager configuration.
External key management	Option “11” Follow the prompts to provide the required inputs to recover and restore the key-manager configuration.

If you are using…

Select this boot menu option…

Onboard key management

Option “10”

Follow the prompts to provide the required inputs to recover and restore the key-manager configuration.

External key management

Option “11”

Follow the prompts to provide the required inputs to recover and restore the key-manager configuration.

ONTAP 9.7 and earlier

For ONTAP 9.7 and earlier, issue the boot menu command.

If you are using…	Issue this command at the boot menu prompt…
Onboard key management	`recover_onboard_keymanager`
External key management	`recover_external_keymanager`

If you are using…

Issue this command at the boot menu prompt…

Onboard key management

recover_onboard_keymanager

External key management

recover_external_keymanager

If autoboot is enabled, interrupt autoboot by pressing CTRL-C.
From the boot menu, run option “6”.

Option “6” will reboot the node twice before completing.

Respond “y” to the system id change prompts. Wait for the second reboot messages:
```
Successfully restored env file from boot media...

Rebooting to load the restored env file...
```
Double-check that the partner-sysid is correct:

printenv partner-sysid

If the partner-sysid is not correct, set it:

setenv partner-sysid partner-sysID

If root encryption is used, depending on the ONTAP version you are using, select the boot menu option or issue the boot menu command again for your key management configuration.

ONTAP 9.8 and later

Beginning with ONTAP 9.8, select the boot menu option.

If you are using…	Select this boot menu option…
Onboard key management	Option “10” Follow the prompts to provide the required inputs to recover and restore the key-manager configuration.
External key management	Option “11” Follow the prompts to provide the required inputs to recover and restore the key-manager configuration.

If you are using…

Select this boot menu option…

Onboard key management

Option “10”

Follow the prompts to provide the required inputs to recover and restore the key-manager configuration.

External key management

Option “11”

Follow the prompts to provide the required inputs to recover and restore the key-manager configuration.

Depending on the key manager setting, perform the recovery procedure by selecting option “10” or option “11”, followed by option “6” at the first boot menu prompt. To boot the nodes completely, you might need to repeat the recovery procedure continued by option “1” (normal boot).

ONTAP 9.7 and earlier

For ONTAP 9.7 and earlier, issue the boot menu command.

If you are using…	Issue this command at the boot menu prompt…
Onboard key management	`recover_onboard_keymanager`
External key management	`recover_external_keymanager`

If you are using…

Issue this command at the boot menu prompt…

Onboard key management

recover_onboard_keymanager

External key management

recover_external_keymanager

You might need to issue the recover_xxxxxxxx_keymanager command at the boot menu prompt multiple times until the nodes completely boot.

Boot the nodes:

boot_ontap
Wait for the replaced nodes to boot up.

If either node is in takeover mode, perform a giveback:

storage failover giveback
Verify that all ports are in a broadcast domain:
1. View the broadcast domains:
  
  network port broadcast-domain show
2. Add any ports to a broadcast domain as needed.
  
  Add or remove ports from a broadcast domain
3. Add the physical port that will host the intercluster LIFs to the corresponding Broadcast domain.
4. Modify intercluster LIFs to use the new physical port as home port.
5. After the intercluster LIFs are up, check the cluster peer status and re-establish cluster peering as needed.
  
  You might need to reconfigure cluster peering.
  
  Create a cluster peer relationship
6. Recreate VLANs and interface groups as needed.
  
  VLAN and interface group membership might be different than that of the old node.
  
  Create a VLAN
  
  Combine physical ports to create interface groups

If encryption is used, restore the keys using the correct command for your key management configuration.

If you are using…	Use this command…
Onboard key management	`security key-manager onboard sync` For more information, see Restoring onboard key management encryption keys.
External key management	`security key-manager external restore -vserver SVM -node node -key-server host_name\|IP_address:port -key-id key_id -key-tag key_tag node-name` For more information, see Restoring external key management encryption keys.

If you are using…

Use this command…

Onboard key management

security key-manager onboard sync

For more information, see Restoring onboard key management encryption keys.

External key management

security key-manager external restore -vserver SVM -node node -key-server host_name|IP_address:port -key-id key_id -key-tag key_tag node-name

For more information, see Restoring external key management encryption keys.

Verify LIF configuration

Verify that LIFs are hosted on appropriate node/ports prior to switchback. The following steps need to be performed

About this task

This task is performed on site_B, where the nodes have been booted up with root aggregates.

Steps

Verify that LIFs are hosted on the appropriate node and ports prior to switchback.
1. Change to the advanced privilege level:
  
  set -privilege advanced
2. Override the port configuration to ensure proper LIF placement:
  
  vserver config override -command "network interface modify -vserver vserver_name -home-port active_port_after_upgrade -lif lif_name -home-node new_node_name"
  
  When entering the network interface modify command within the vserver config override command, you cannot use the tab autocomplete feature. You can create the network interface modify using autocomplete and then enclose it in the vserver config override command.
3. Return to the admin privilege level:
  set -privilege admin
Revert the interfaces to their home node:

network interface revert * -vserver vserver-name

Perform this step on all SVMs as required.

Install the new licenses

Before the switchback operation, you must install licenses for the new controllers.

Steps

Installing licenses for the new controller module

Switch back the MetroCluster configuration

After the new controllers have been configured, you switch back the MetroCluster configuration to return the configuration to normal operation.

About this task

In this task, you will perform the switchback operation, returning the MetroCluster configuration to normal operation. The nodes on site_A are still awaiting upgrade.

Site_A awaiting upgrade after MetroCluster switchback

Steps

Issue the metrocluster node show command on site_B and check the output.
1. Verify that the new nodes are represented correctly.
2. Verify that the new nodes are in "Waiting for switchback state."
Switchback the cluster:

metrocluster switchback

Check the progress of the switchback operation:

metrocluster show

The switchback operation is still in progress when the output displays waiting-for-switchback:

cluster_B::> metrocluster show
Cluster                   Entry Name          State
------------------------- ------------------- -----------
 Local: cluster_B         Configuration state configured
                          Mode                switchover
                          AUSO Failure Domain -
Remote: cluster_A         Configuration state configured
                          Mode                waiting-for-switchback
                          AUSO Failure Domain -

The switchback operation is complete when the output displays normal:

cluster_B::> metrocluster show
Cluster                   Entry Name          State
------------------------- ------------------- -----------
 Local: cluster_B         Configuration state configured
                          Mode                normal
                          AUSO Failure Domain -
Remote: cluster_A         Configuration state configured
                          Mode                normal
                          AUSO Failure Domain -

If a switchback takes a long time to finish, you can check on the status of in-progress baselines by using the metrocluster config-replication resync-status show command. This command is at the advanced privilege level.

Check the health of the MetroCluster configuration

After upgrading the controller modules you must verify the health of the MetroCluster configuration.

About this task

This task can be performed on any node in the MetroCluster configuration.

Steps

Verify the operation of the MetroCluster configuration:
1. Confirm the MetroCluster configuration and that the operational mode is normal:
  
  metrocluster show
2. Perform a MetroCluster check:
  
  metrocluster check run
3. Display the results of the MetroCluster check:
  
  metrocluster check show
  
  After you run metrocluster check run and metrocluster check show, you see an error message similar to the following:
  Example
  Failed to validate the node and cluster components before the switchover operation. Cluster_A:: node_A_1 (non-overridable veto): DR partner NVLog mirroring is not online. Make sure that the links between the two sites are healthy and properly configured.
  This is expected behavior due to a controller mismatch during the upgrade process and the error message can be safely ignored.

Upgrade the nodes on cluster_A

You must repeat the upgrade tasks on cluster_A.

Step

Repeat the steps to upgrade the nodes on cluster_A, beginning with Prepare for the upgrade.

When you repeat the procedure, all example references to the clusters and nodes are reversed. For example, when the example is given to switchover from cluster_A, you will switchover from cluster_B.

Send a custom AutoSupport message after maintenance

After completing the upgrade, you should send an AutoSupport message indicating the end of maintenance, so automatic case creation can resume.

Step

To resume automatic support case generation, send an AutoSupport message to indicate that the maintenance is complete.
1. Issue the following command:
  
  system node autosupport invoke -node * -type all -message MAINT=end
2. Repeat the command on the partner cluster.

Restore Tiebreaker monitoring

If the MetroCluster configuration was previously configured for monitoring by the Tiebreaker software, you can restore the Tiebreaker connection.

Use the steps in Add MetroCluster configurations in MetroCluster Tiebreaker Installation and Configuration.

Upgrade controllers in a MetroCluster FC configuration using switchover and switchback

Creating your file...

Supported platform combinations

About this task

Enable console logging

Prepare for the upgrade

Verify the health of the MetroCluster configuration

Map ports from the old nodes to the new nodes

Gather information before the upgrade

Remove the existing configuration from the Tiebreaker or other monitoring software

Send a custom AutoSupport message prior to maintenance

Switch over the MetroCluster configuration

Prepare the network configuration of the old controllers

Remove the old platforms

Configure the new controllers

Set up the new controllers

Netboot the new controllers

Clear the configuration on a controller module

Restore the HBA configuration

Set the HA state on the new controllers and chassis

Reassign root aggregate disks

Boot up the new controllers

Verify LIF configuration

Install the new licenses

Switch back the MetroCluster configuration

Check the health of the MetroCluster configuration

Upgrade the nodes on cluster_A

Send a custom AutoSupport message after maintenance

Restore Tiebreaker monitoring