更换Cisco Nexus 9336C-FX2 共享交换机
您可以更换有缺陷的 Nexus 9336C-FX2 共享交换机。这是一个无中断程序 (NDU)。
更换开关之前,请确保:
-
在现有的集群和网络基础设施中:
-
现有集群已验证功能完全正常,至少有一个完全连接的集群交换机。
-
集群所有端口均已启动。
-
所有集群逻辑接口(LIF)均已启动并位于其所属端口上。
-
ONTAP集群 ping-cluster -node node1 命令必须表明所有路径上的基本连接和大于 PMTU 的通信均已成功。
-
-
适用于 Nexus 9336C-FX2 的替换开关:
-
替换交换机的管理网络连接功能正常。
-
已具备对替换开关的控制台访问权限。
-
节点连接端口为 1/1 至 1/34:
-
端口 1/35 和 1/36 上的所有交换机间链路 (ISL) 端口均已禁用。
-
将所需的参考配置文件 (RCF) 和 NX-OS 操作系统映像交换机加载到交换机上。
-
之前站点的所有自定义设置,如 STP、SNMP 和 SSH,都应该复制到新交换机上。
-
您必须从集群 LIF 所在的节点执行迁移集群 LIF 的命令。
本流程中的示例使用以下开关和节点命名规则:
-
现有的 Nexus 9336C-FX2 交换机的名称是 sh1 和 sh2。
-
新款 Nexus 9336C-FX2 交换机的名称是 newsh1 和 newsh2。
-
节点名称为 node1 和 node2。
-
每个节点上的集群端口分别命名为_e3a_和_e3b_。
-
集群 LIF 名称为 `node1_clus1`和 `node1_clus2`对于节点1,以及 `node2_clus1`和 `node2_clus2`对于节点2。
-
对所有集群节点进行更改的提示是 cluster1::*>。
以下步骤基于以下网络拓扑结构:
展示示例拓扑
cluster1::*> network port show -ipspace Cluster
Node: node1
Ignore
Speed(Mbps) Health Health
Port IPspace Broadcast Domain Link MTU Admin/Oper Status Status
--------- ------------ ---------------- ---- ---- ------------ -------- ------
e3a Cluster Cluster up 9000 auto/100000 healthy false
e3b Cluster Cluster up 9000 auto/100000 healthy false
Node: node2
Ignore
Speed(Mbps) Health Health
Port IPspace Broadcast Domain Link MTU Admin/Oper Status Status
--------- ------------ ---------------- ---- ---- ------------ -------- ------
e3a Cluster Cluster up 9000 auto/100000 healthy false
e3b Cluster Cluster up 9000 auto/100000 healthy false
4 entries were displayed.
cluster1::*> network interface show -vserver Cluster
Logical Status Network Current Current Is
Vserver Interface Admin/Oper Address/Mask Node Port Home
----------- ---------- ---------- ------------------ ------------- ------- ----
Cluster
node1_clus1 up/up 169.254.209.69/16 node1 e3a true
node1_clus2 up/up 169.254.49.125/16 node1 e3b true
node2_clus1 up/up 169.254.47.194/16 node2 e3a true
node2_clus2 up/up 169.254.19.183/16 node2 e3b true
4 entries were displayed.
cluster1::*> network device-discovery show -protocol cdp
Node/ Local Discovered
Protocol Port Device (LLDP: ChassisID) Interface Platform
----------- ------ ------------------------- ---------------- ----------------
node2 /cdp
e3a sh1 Eth1/2 N9K-C9336C
e3b sh2 Eth1/2 N9K-C9336C
node1 /cdp
e3a sh1 Eth1/1 N9K-C9336C
e3b sh2 Eth1/1 N9K-C9336C
4 entries were displayed.
sh1# show cdp neighbors
Capability Codes: R - Router, T - Trans-Bridge, B - Source-Route-Bridge
S - Switch, H - Host, I - IGMP, r - Repeater,
V - VoIP-Phone, D - Remotely-Managed-Device,
s - Supports-STP-Dispute
Device-ID Local Intrfce Hldtme Capability Platform Port ID
node1 Eth1/1 144 H FAS2980 e3a
node2 Eth1/2 145 H FAS2980 e3a
sh2 Eth1/35 176 R S I s N9K-C9336C Eth1/35
sh2 (FDO220329V5) Eth1/36 176 R S I s N9K-C9336C Eth1/36
Total entries displayed: 4
sh2# show cdp neighbors
Capability Codes: R - Router, T - Trans-Bridge, B - Source-Route-Bridge
S - Switch, H - Host, I - IGMP, r - Repeater,
V - VoIP-Phone, D - Remotely-Managed-Device,
s - Supports-STP-Dispute
Device-ID Local Intrfce Hldtme Capability Platform Port ID
node1 Eth1/1 139 H FAS2980 eb
node2 Eth1/2 124 H FAS2980 eb
sh1 Eth1/35 178 R S I s N9K-C9336C Eth1/35
sh1 Eth1/36 178 R S I s N9K-C9336C Eth1/36
Total entries displayed: 4
-
如果此集群上启用了AutoSupport ,则通过调用AutoSupport消息来抑制自动创建案例:
system node autosupport invoke -node * -type all -message MAINT=xh其中 x 为维护窗口的持续时间(小时)。
-
可选:在交换机上安装相应的 RCF 和映像、newsh2,并进行任何必要的站点准备。
-
如有必要,请验证、下载并安装适用于新交换机的 RCF 和 NX-OS 软件的相应版本。如果您已确认新交换机设置正确,且无需更新 RCF 和 NX-OS 软件,请继续。第 3 步 。
-
前往NetApp支持网站上的NetApp集群和管理网络交换机参考配置文件说明页面。
-
点击集群网络和管理网络兼容性矩阵的链接,然后记下所需的交换机软件版本。
-
点击浏览器后退箭头返回描述页面,点击继续,接受许可协议,然后前往下载页面。
-
请按照下载页面上的步骤,下载与您要安装的ONTAP软件版本相对应的正确 RCF 和 NX-OS 文件。
-
-
在新交换机上,以管理员身份登录,并关闭所有将连接到节点集群接口的端口(端口 1/1 到 1/34)。如果您要更换的开关无法正常工作且已断电,请转到:第 4 步 。集群节点上的 LIF 应该已经针对每个节点故障转移到另一个集群端口。
显示示例
newsh2# config Enter configuration commands, one per line. End with CNTL/Z. newsh2(config)# interface e1/1-34 newsh2(config-if-range)# shutdown
-
network interface show - vserver Cluster -fields auto-revert显示示例
cluster1::> network interface show -vserver Cluster -fields auto-revert Logical Vserver Interface Auto-revert ------------ ------------- ------------- Cluster node1_clus1 true Cluster node1_clus2 true Cluster node2_clus1 true Cluster node2_clus2 true 4 entries were displayed.
你可以使用 `network interface check cluster-connectivity`执行命令以启动集群连接性检查,然后显示详细信息:
network interface check cluster-connectivity start`和 `network interface check cluster-connectivity show
cluster1::*> network interface check cluster-connectivity start
*注意:*运行程序前请等待几秒钟 `show`显示详细信息的命令。
cluster1::*> network interface check cluster-connectivity show
Source Destination Packet
Node Date LIF LIF Loss
------ -------------------------- ---------------- ---------------- -----------
node1
3/5/2022 19:21:18 -06:00 node1_clus2 node2-clus1 none
3/5/2022 19:21:20 -06:00 node1_clus2 node2_clus2 none
node2
3/5/2022 19:21:18 -06:00 node2_clus2 node1_clus1 none
3/5/2022 19:21:20 -06:00 node2_clus2 node1_clus2 none
对于所有ONTAP版本,您也可以使用 `cluster ping-cluster -node <name>`检查连接性的命令:
cluster ping-cluster -node <name>
cluster1::*> cluster ping-cluster -node local Host is node2 Getting addresses from network interface table... Cluster node1_clus1 169.254.209.69 node1 e3a Cluster node1_clus2 169.254.49.125 node1 e3b Cluster node2_clus1 169.254.47.194 node2 e3a Cluster node2_clus2 169.254.19.183 node2 e3b Local = 169.254.47.194 169.254.19.183 Remote = 169.254.209.69 169.254.49.125 Cluster Vserver Id = 4294967293 Ping status: .... Basic connectivity succeeds on 4 path(s) Basic connectivity fails on 0 path(s) ................ Detected 9000 byte MTU on 4 path(s): Local 169.254.47.194 to Remote 169.254.209.69 Local 169.254.47.194 to Remote 169.254.49.125 Local 169.254.19.183 to Remote 169.254.209.69 Local 169.254.19.183 to Remote 169.254.49.125 Larger than PMTU communication succeeds on 4 path(s) RPC status: 2 paths up, 0 paths down (tcp check) 2 paths up, 0 paths down (udp check)
-
关闭 Nexus 9336C-FX2 交换机 sh1 上的 ISL 端口 1/35 和 1/36。
显示示例
sh1# configure Enter configuration commands, one per line. End with CNTL/Z. sh1(config)# interface e1/35-36 sh1(config-if-range)# shutdown
-
从 Nexus 9336C-FX2 sh2 交换机上拆下所有电缆,然后将它们连接到 Nexus C9336C-FX2 newsh2 交换机上的相同端口。
-
在 sh1 和 newsh2 交换机之间启动 ISL 端口 1/35 和 1/36,然后验证端口通道运行状态。
Port-Channel 应指示 Po1(SU),成员端口应指示 Eth1/35(P) 和 Eth1/36(P)。
显示示例
此示例启用 ISL 端口 1/35 和 1/36,并在交换机 sh1 上显示端口通道摘要。
sh1# configure Enter configuration commands, one per line. End with CNTL/Z. sh1 (config)# int e1/35-36 sh1 (config-if-range)# no shutdown sh1 (config-if-range)# show port-channel summary Flags: D - Down P - Up in port-channel (members) I - Individual H - Hot-standby (LACP only) s - Suspended r - Module-removed b - BFD Session Wait S - Switched R - Routed U - Up (port-channel) p - Up in delay-lacp mode (member) M - Not in use. Min-links not met -------------------------------------------------------------------------------- Group Port- Type Protocol Member Ports Channel -------------------------------------------------------------------------------- 1 Po1(SU) Eth LACP Eth1/35(P) Eth1/36(P) sh1 (config-if-range)# -
步骤9:验证所有节点上的 e3b 端口是否已启用:
network port show ipspace Cluster显示示例
输出结果应如下所示:
cluster1::*> network port show -ipspace Cluster Node: node1 Ignore Speed(Mbps) Health Health Port IPspace Broadcast Domain Link MTU Admin/Oper Status Status --------- ------------ ---------------- ---- ----- ---------- - - -------- ---- e3a Cluster Cluster up 9000 auto/100000 healthy false e3b Cluster Cluster up 9000 auto/100000 healthy false Node: node2 Ignore Speed(Mbps) Health Health Port IPspace Broadcast Domain Link MTU Admin/Oper Status Status --------- ------------ ---------------- ---- ----- ----------- - -------- ---- e3a Cluster Cluster up 9000 auto/100000 healthy false e3b Cluster Cluster up 9000 auto/auto - false 4 entries were displayed. -
在上一步中使用的同一节点上,使用 network interface revert 命令还原与上一步中的端口关联的集群 LIF。
在本例中,如果 Home 值为 true 且端口为 e3b,则节点 1 上的 LIF node1_clus2 将成功还原。
以下命令将节点 1 上的 LIF node1_clus2 返回到主端口 e3a,并显示有关两个节点上 LIF 的信息。如果两个集群接口的“Is Home”列均为*true*,并且它们显示正确的端口分配(在本例中为节点 1 上的 e3a 和 e3b),则启动第一个节点成功。
显示示例
cluster1::*> network interface show -vserver Cluster Logical Status Network Current Current Is Vserver Interface Admin/Oper Address/Mask Node Port Home ----------- ------------ ---------- ------------------ ---------- ------- ----- Cluster node1_clus1 up/up 169.254.209.69/16 node1 e3a true node1_clus2 up/up 169.254.49.125/16 node1 e3b true node2_clus1 up/up 169.254.47.194/16 node2 e3a true node2_clus2 up/up 169.254.19.183/16 node2 e3a false 4 entries were displayed. -
cluster show显示示例
此示例表明,该集群中节点 1 和节点 2 的节点健康状况为真:
cluster1::*> cluster show Node Health Eligibility ------------- ------- ------------ node1 false true node2 true true
-
步骤12:验证所有物理集群端口是否都已启动:
network port show ipspace Cluster显示示例
cluster1::*> network port show -ipspace Cluster Node node1 Ignore Speed(Mbps) Health Health Port IPspace Broadcast Domain Link MTU Admin/Oper Status Status --------- ----------- ----------------- ----- ----- ------------ -------- ------ e3a Cluster Cluster up 9000 auto/100000 healthy false e3b Cluster Cluster up 9000 auto/100000 healthy false Node: node2 Ignore Speed(Mbps) Health Health Port IPspace Broadcast Domain Link MTU Admin/Oper Status Status --------- ------------ ---------------- ----- ----- ------------ -------- ------ e3a Cluster Cluster up 9000 auto/100000 healthy false e3b Cluster Cluster up 9000 auto/100000 healthy false 4 entries were displayed.
你可以使用 `network interface check cluster-connectivity`执行命令以启动集群连接性检查,然后显示详细信息:
network interface check cluster-connectivity start`和 `network interface check cluster-connectivity show
cluster1::*> network interface check cluster-connectivity start
*注意:*运行程序前请等待几秒钟 `show`显示详细信息的命令。
cluster1::*> network interface check cluster-connectivity show
Source Destination Packet
Node Date LIF LIF Loss
------ -------------------------- ---------------- ---------------- -----------
node1
3/5/2022 19:21:18 -06:00 node1_clus2 node2-clus1 none
3/5/2022 19:21:20 -06:00 node1_clus2 node2_clus2 none
node2
3/5/2022 19:21:18 -06:00 node2_clus2 node1_clus1 none
3/5/2022 19:21:20 -06:00 node2_clus2 node1_clus2 none
对于所有ONTAP版本,您也可以使用 `cluster ping-cluster -node <name>`检查连接性的命令:
cluster ping-cluster -node <name>
cluster1::*> cluster ping-cluster -node local Host is node2 Getting addresses from network interface table... Cluster node1_clus1 169.254.209.69 node1 e3a Cluster node1_clus2 169.254.49.125 node1 e3b Cluster node2_clus1 169.254.47.194 node2 e3a Cluster node2_clus2 169.254.19.183 node2 e3b Local = 169.254.47.194 169.254.19.183 Remote = 169.254.209.69 169.254.49.125 Cluster Vserver Id = 4294967293 Ping status: .... Basic connectivity succeeds on 4 path(s) Basic connectivity fails on 0 path(s) ................ Detected 9000 byte MTU on 4 path(s): Local 169.254.47.194 to Remote 169.254.209.69 Local 169.254.47.194 to Remote 169.254.49.125 Local 169.254.19.183 to Remote 169.254.209.69 Local 169.254.19.183 to Remote 169.254.49.125 Larger than PMTU communication succeeds on 4 path(s) RPC status: 2 paths up, 0 paths down (tcp check) 2 paths up, 0 paths down (udp check)
-
network port show显示示例
cluster1::*> network port show -ipspace Cluster Node: node1 Ignore Speed(Mbps) Health Health Port IPspace Broadcast Domain Link MTU Admin/Oper Status Status --------- ----------- ---------------- ---- ----- ------------ -------- ------ e3a Cluster Cluster up 9000 auto/100000 healthy false e3b Cluster Cluster up 9000 auto/100000 healthy false Node: node2 Ignore Speed(Mbps) Health Health Port IPspace Broadcast Domain Link MTU Admin/Oper Status Status --------- ------------ ---------------- ---- ---- ------------ -------- ------ e3a Cluster Cluster up 9000 auto/100000 healthy false e3b Cluster Cluster up 9000 auto/100000 healthy false 4 entries were displayed. cluster1::*> network interface show -vserver Cluster Logical Status Network Current Current Is Vserver Interface Admin/Oper Address/Mask Node Port Home ----------- ---------- ---------- ------------------ ------------- ------- ---- Cluster node1_clus1 up/up 169.254.209.69/16 node1 e3a true node1_clus2 up/up 169.254.49.125/16 node1 e3b true node2_clus1 up/up 169.254.47.194/16 node2 e3a true node2_clus2 up/up 169.254.19.183/16 node2 e3b true 4 entries were displayed. cluster1::> network device-discovery show -protocol cdp Node/ Local Discovered Protocol Port Device (LLDP: ChassisID) Interface Platform ----------- ------ ------------------------- ---------------- ---------------- node2 /cdp e3a sh1 0/2 N9K-C9336C e3b newsh2 0/2 N9K-C9336C node1 /cdp e3a sh1 0/1 N9K-C9336C e3b newsh2 0/1 N9K-C9336C 4 entries were displayed. sh1# show cdp neighbors Capability Codes: R - Router, T - Trans-Bridge, B - Source-Route-Bridge S - Switch, H - Host, I - IGMP, r - Repeater, V - VoIP-Phone, D - Remotely-Managed-Device, s - Supports-STP-Dispute Device-ID Local Intrfce Hldtme Capability Platform Port ID node1 Eth1/1 144 H FAS2980 e3a node2 Eth1/2 145 H FAS2980 e3a newsh2 Eth1/35 176 R S I s N9K-C9336C Eth1/35 newsh2 Eth1/36 176 R S I s N9K-C9336C Eth1/36 Total entries displayed: 4 sh2# show cdp neighbors Capability Codes: R - Router, T - Trans-Bridge, B - Source-Route-Bridge S - Switch, H - Host, I - IGMP, r - Repeater, V - VoIP-Phone, D - Remotely-Managed-Device, s - Supports-STP-Dispute Device-ID Local Intrfce Hldtme Capability Platform Port ID node1 Eth1/1 139 H FAS2980 e3b node2 Eth1/2 124 H FAS2980 eb sh1 Eth1/35 178 R S I s N9K-C9336C Eth1/35 sh1 Eth1/36 178 R S I s N9K-C9336C Eth1/36 Total entries displayed: 4 -
验证连接到 HA 对 1 的存储设备(共享交换机 newsh2)是否正常。
-
请确认连接到 HA 对 2 的共享交换机 newsh2 的存储设备运行状况良好:
storage port show -port-type ENET显示示例
storage::*> storage port show -port-type ENET Speed VLAN Node Port Type Mode (Gb/s) State Status ID ------- ------- ------- ---------- ----------- --------- --------- ----- node1 e3a ENET storage 100 enabled online 30 e3b ENET storage 0 enabled offline 30 e7a ENET storage 0 enabled offline 30 e7b ENET storage 100 enabled online 30 node2 e3a ENET storage 100 enabled online 30 e3b ENET storage 0 enabled offline 30 e7a ENET storage 0 enabled offline 30 e7b ENET storage 100 enabled online 30 -
步骤18:确认货架上的电缆连接是否正确:
storage shelf port show -fields remote- device,remote-port显示示例
cluster1::*> storage shelf port show -fields remote-device,remote-port shelf id remote-port remote-device ----- -- ------------ ---------------------------- 3.20 0 Ethernet1/13 sh1 3.20 1 Ethernet1/13 newsh2 3.20 2 Ethernet1/14 sh1 3.20 3 Ethernet1/14 newsh2 3.30 0 Ethernet1/15 sh1 3.30 1 Ethernet1/15 newsh2 3.30 2 Ethernet1/16 sh1 3.30 3 Ethernet1/16 newsh2 8 entries were displayed.
-
对交换机 sh1 和新交换机 newsh1 重复这些步骤。
-
如果您已禁用自动创建案例功能,请通过调用AutoSupport消息重新启用该功能:
system node autosupport invoke -node * -type all -message MAINT=END
更换开关后,您可以 "配置交换机健康监控"。