将Cisco集群交换机迁移到NVIDIA SN2100 集群交换机
您可以将用于ONTAP集群的Cisco集群交换机无中断地迁移到NVIDIA SN2100 集群交换机。当您用NVIDIA SN2100 集群交换机替换一些较旧的Cisco集群交换机时,您必须了解某些配置信息、端口连接和布线要求。
支持以下Cisco集群交换机:
-
Nexus 9336C-FX2
-
Nexus 92300YC
-
Nexus 5596UP
-
Nexus 3232C
-
Nexus 3132Q-V
您可以将ONTAP集群中较旧的Cisco集群交换机无中断地迁移到NVIDIA SN2100 集群交换机。
-
现有集群必须已正确设置并正常运行。
-
为确保运行不中断,所有集群端口必须处于启用状态。
-
NVIDIA SN2100 集群交换机必须配置并在安装了正确版本的 Cumulus Linux 的环境下运行,并且应用了参考配置文件 (RCF)。
-
现有集群网络配置必须包含以下内容:
-
使用较旧的Cisco交换机构建冗余且功能齐全的NetApp集群。
-
对旧款Cisco交换机和新款交换机的管理连接和控制台访问。
-
所有处于启动状态的集群 LIF 都位于其主端口上。
-
ISL 端口已启用,并且已在旧款Cisco交换机和新款交换机之间连接了线缆。
-
-
参见 "Hardware Universe"有关支持的端口及其配置的完整详细信息。
-
您已将NVIDIA SN2100 交换机上的一些端口配置为以 40 GbE 或 100 GbE 运行。
-
您已规划、迁移并记录了从节点到NVIDIA SN2100 集群交换机的 40 GbE 和 100 GbE 连接。
在此过程中,使用Cisco Nexus 3232C 集群交换机作为示例命令和输出。
本流程中的示例使用以下开关和节点命名规则:
-
现有的Cisco Nexus 3232C 集群交换机是 c1 和 c2。
-
新的NVIDIA SN2100 集群交换机是 sw1 和 sw2。
-
节点分别为 node1 和 node2。
-
节点 1 上的集群 LIF 分别为 node1_clus1 和 node1_clus2,节点 2 上的集群 LIF 分别为 node2_clus1 和 node2_clus2。
-
这 `cluster1::*>`prompt 指示集群名称。
-
此过程中使用的集群端口为 e3a 和 e3b。
-
分支端口采用以下格式:swp[端口]s[分支端口 0-3]。例如,swp1 上的四个分支端口分别是 swp1s0、swp1s1、swp1s2 和 swp1s3。
-
首先将交换机 c2 替换为交换机 sw2,然后将交换机 c1 替换为交换机 sw1。
-
然后断开节点与 c2 之间的电缆与 c2 的连接,并重新连接到 sw2。
-
然后断开节点与 c1 之间的电缆与 c1 的连接,并重新连接到 sw1。
-
-
如果此集群上启用了AutoSupport ,则通过调用AutoSupport消息来抑制自动创建案例:
system node autosupport invoke -node * -type all -message MAINT=xh其中 x 为维护窗口的持续时间(小时)。
-
将权限级别更改为高级,并在提示继续时输入 y:
set -privilege advanced出现高级提示符(*>)。
-
禁用集群 LIF 的自动回滚功能:
network interface modify -vserver Cluster -lif * -auto-revert falsecluster1::*> network interface modify -vserver Cluster -lif * -auto-revert false Warning: Disabling the auto-revert feature of the cluster logical interface may effect the availability of your cluster network. Are you sure you want to continue? {y|n}: y -
确定每个集群接口的管理或运行状态:
每个端口都应该显示出来。
Link`而且对身体有益 `Health Status。-
显示网络端口属性:
network port show -ipspace Clustercluster1::*> network port show -ipspace Cluster Node: node1 Ignore Speed(Mbps) Health Health Port IPspace Broadcast Domain Link MTU Admin/Oper Status Status --------- ---------- ---------------- ---- ----- ------------ -------- ------ e3a Cluster Cluster up 9000 auto/100000 healthy false e3b Cluster Cluster up 9000 auto/100000 healthy false Node: node2 Ignore Speed(Mbps) Health Health Port IPspace Broadcast Domain Link MTU Admin/Oper Status Status --------- ---------- ---------------- ---- ----- ------------ -------- ------ e3a Cluster Cluster up 9000 auto/100000 healthy false e3b Cluster Cluster up 9000 auto/100000 healthy false -
显示有关逻辑接口及其指定归属节点的信息:
network interface show -vserver Cluster每个 LIF 都应该显示 up/up。
Status Admin/Oper`确实如此 `Is Home。
cluster1::*> network interface show -vserver Cluster Logical Status Network Current Current Is Vserver Interface Admin/Oper Address/Mask Node Port Home ----------- ----------- ---------- ------------------ ----------- ------- ---- Cluster node1_clus1 up/up 169.254.209.69/16 node1 e3a true node1_clus2 up/up 169.254.49.125/16 node1 e3b true node2_clus1 up/up 169.254.47.194/16 node2 e3a true node2_clus2 up/up 169.254.19.183/16 node2 e3b true -
-
从节点的角度来看,每个节点上的集群端口通过以下命令以如下方式连接到现有的集群交换机:
network device-discovery show -protocol lldpcluster1::*> network device-discovery show -protocol lldp Node/ Local Discovered Protocol Port Device (LLDP: ChassisID) Interface Platform ----------- ------ ------------------------- ---------------- ---------------- node1 /lldp e3a c1 (6a:ad:4f:98:3b:3f) Eth1/1 - e3b c2 (6a:ad:4f:98:4c:a4) Eth1/1 - node2 /lldp e3a c1 (6a:ad:4f:98:3b:3f) Eth1/2 - e3b c2 (6a:ad:4f:98:4c:a4) Eth1/2 - -
集群端口和交换机通过以下命令连接(从交换机的角度来看):
show cdp neighborsc1# show cdp neighbors Capability Codes: R - Router, T - Trans-Bridge, B - Source-Route-Bridge S - Switch, H - Host, I - IGMP, r - Repeater, V - VoIP-Phone, D - Remotely-Managed-Device, s - Supports-STP-Dispute Device-ID Local Intrfce Hldtme Capability Platform Port ID node1 Eth1/1 124 H AFF-A400 e3a node2 Eth1/2 124 H AFF-A400 e3a c2 Eth1/31 179 S I s N3K-C3232C Eth1/31 c2 Eth1/32 175 S I s N3K-C3232C Eth1/32 c2# show cdp neighbors Capability Codes: R - Router, T - Trans-Bridge, B - Source-Route-Bridge S - Switch, H - Host, I - IGMP, r - Repeater, V - VoIP-Phone, D - Remotely-Managed-Device, s - Supports-STP-Dispute Device-ID Local Intrfce Hldtme Capability Platform Port ID node1 Eth1/1 124 H AFF-A400 e3b node2 Eth1/2 124 H AFF-A400 e3b c1 Eth1/31 175 S I s N3K-C3232C Eth1/31 c1 Eth1/32 175 S I s N3K-C3232C Eth1/32 -
验证远程集群接口的连接性:
你可以使用 `network interface check cluster-connectivity`执行命令以启动集群连接性检查,然后显示详细信息:
network interface check cluster-connectivity start`和 `network interface check cluster-connectivity show
cluster1::*> network interface check cluster-connectivity start
*注意:*运行程序前请等待几秒钟 `show`显示详细信息的命令。
cluster1::*> network interface check cluster-connectivity show
Source Destination Packet
Node Date LIF LIF Loss
------ -------------------------- ---------------- ---------------- -----------
node1
3/5/2022 19:21:18 -06:00 node1_clus2 node2-clus1 none
3/5/2022 19:21:20 -06:00 node1_clus2 node2_clus2 none
node2
3/5/2022 19:21:18 -06:00 node2_clus2 node1_clus1 none
3/5/2022 19:21:20 -06:00 node2_clus2 node1_clus2 none
对于所有ONTAP版本,您也可以使用 `cluster ping-cluster -node <name>`检查连接性的命令:
cluster ping-cluster -node <name>
cluster1::*> cluster ping-cluster -node local
Host is node2
Getting addresses from network interface table...
Cluster node1_clus1 169.254.209.69 node1 e3a
Cluster node1_clus2 169.254.49.125 node1 e3b
Cluster node2_clus1 169.254.47.194 node2 e3a
Cluster node2_clus2 169.254.19.183 node2 e3b
Local = 169.254.47.194 169.254.19.183
Remote = 169.254.209.69 169.254.49.125
Cluster Vserver Id = 4294967293
Ping status:....
Basic connectivity succeeds on 4 path(s)
Basic connectivity fails on 0 path(s)
................
Detected 9000 byte MTU on 4 path(s):
Local 169.254.19.183 to Remote 169.254.209.69
Local 169.254.19.183 to Remote 169.254.49.125
Local 169.254.47.194 to Remote 169.254.209.69
Local 169.254.47.194 to Remote 169.254.49.125
Larger than PMTU communication succeeds on 4 path(s)
RPC status:
2 paths up, 0 paths down (tcp check)
2 paths up, 0 paths down (udp check)
-
(c2)# configure Enter configuration commands, one per line. End with CNTL/Z. (c2)(Config)# interface (c2)(config-if-range)# shutdown <interface_list> (c2)(config-if-range)# exit (c2)(Config)# exit (c2)#
-
将节点集群端口从旧交换机 c2 移至新交换机 sw2,使用NVIDIA SN2100 支持的适当线缆。
-
显示网络端口属性:
network port show -ipspace Clustercluster1::*> network port show -ipspace Cluster Node: node1 Ignore Speed(Mbps) Health Health Port IPspace Broadcast Domain Link MTU Admin/Oper Status Status --------- ---------- ---------------- ---- ----- ------------ -------- ------ e3a Cluster Cluster up 9000 auto/100000 healthy false e3b Cluster Cluster up 9000 auto/100000 healthy false Node: node2 Ignore Speed(Mbps) Health Health Port IPspace Broadcast Domain Link MTU Admin/Oper Status Status --------- ---------- ---------------- ---- ----- ------------ -------- ------ e3a Cluster Cluster up 9000 auto/100000 healthy false e3b Cluster Cluster up 9000 auto/100000 healthy false -
从节点的角度来看,每个节点上的集群端口现在按以下方式连接到集群交换机:
cluster1::*> network device-discovery show -protocol lldp Node/ Local Discovered Protocol Port Device (LLDP: ChassisID) Interface Platform ----------- ------ ------------------------- ---------------- ---------------- node1 /lldp e3a c1 (6a:ad:4f:98:3b:3f) Eth1/1 - e3b sw2 (b8:ce:f6:19:1a:7e) swp3 - node2 /lldp e3a c1 (6a:ad:4f:98:3b:3f) Eth1/2 - e3b sw2 (b8:ce:f6:19:1b:96) swp4 - -
在交换机 sw2 上,确认所有节点集群端口均已启动:
net show interfacecumulus@sw2:~$ net show interface State Name Spd MTU Mode LLDP Summary ----- ----------- ---- ----- ---------- ----------------- ---------------------- ... ... UP swp3 100G 9216 Trunk/L2 e3b Master: bridge(UP) UP swp4 100G 9216 Trunk/L2 e3b Master: bridge(UP) UP swp15 100G 9216 BondMember sw1 (swp15) Master: cluster_isl(UP) UP swp16 100G 9216 BondMember sw1 (swp16) Master: cluster_isl(UP)
-
在交换机 c1 上,关闭连接到节点集群端口的端口。
(c1)# configure Enter configuration commands, one per line. End with CNTL/Z. (c1)(Config)# interface (c1)(config-if-range)# shutdown <interface_list> (c1)(config-if-range)# exit (c1)(Config)# exit (c1)#
-
将节点集群端口从旧交换机 c1 移至新交换机 sw1,使用NVIDIA SN2100 支持的适当线缆。
-
验证集群的最终配置:
network port show -ipspace Cluster每个端口都应该显示出来。
Link`而且对身体有益 `Health Status。cluster1::*> network port show -ipspace Cluster Node: node1 Ignore Speed(Mbps) Health Health Port IPspace Broadcast Domain Link MTU Admin/Oper Status Status --------- ---------- ---------------- ---- ----- ------------ -------- ------ e3a Cluster Cluster up 9000 auto/100000 healthy false e3b Cluster Cluster up 9000 auto/100000 healthy false Node: node2 Ignore Speed(Mbps) Health Health Port IPspace Broadcast Domain Link MTU Admin/Oper Status Status --------- ---------- ---------------- ---- ----- ------------ -------- ------ e3a Cluster Cluster up 9000 auto/100000 healthy false e3b Cluster Cluster up 9000 auto/100000 healthy false -
从节点的角度来看,每个节点上的集群端口现在按以下方式连接到集群交换机:
cluster1::*> network device-discovery show -protocol lldp Node/ Local Discovered Protocol Port Device (LLDP: ChassisID) Interface Platform ----------- ------ ------------------------- -------------- ---------------- node1 /lldp e3a sw1 (b8:ce:f6:19:1a:7e) swp3 - e3b sw2 (b8:ce:f6:19:1b:96) swp3 - node2 /lldp e3a sw1 (b8:ce:f6:19:1a:7e) swp4 - e3b sw2 (b8:ce:f6:19:1b:96) swp4 - -
在交换机 sw1 和 sw2 上,验证所有节点集群端口是否都已启动:
net show interfacecumulus@sw1:~$ net show interface State Name Spd MTU Mode LLDP Summary ----- ----------- ---- ----- ---------- ----------------- ---------------------- ... ... UP swp3 100G 9216 Trunk/L2 e3a Master: bridge(UP) UP swp4 100G 9216 Trunk/L2 e3a Master: bridge(UP) UP swp15 100G 9216 BondMember sw2 (swp15) Master: cluster_isl(UP) UP swp16 100G 9216 BondMember sw2 (swp16) Master: cluster_isl(UP) cumulus@sw2:~$ net show interface State Name Spd MTU Mode LLDP Summary ----- ----------- ---- ----- ---------- ----------------- ----------------------- ... ... UP swp3 100G 9216 Trunk/L2 e3b Master: bridge(UP) UP swp4 100G 9216 Trunk/L2 e3b Master: bridge(UP) UP swp15 100G 9216 BondMember sw1 (swp15) Master: cluster_isl(UP) UP swp16 100G 9216 BondMember sw1 (swp16) Master: cluster_isl(UP)
-
确认两个节点都分别与每个交换机建立了一条连接:
net show lldp以下示例显示了两种开关的正确结果:
cumulus@sw1:~$ net show lldp LocalPort Speed Mode RemoteHost RemotePort --------- ----- ---------- ------------------ ----------- swp3 100G Trunk/L2 node1 e3a swp4 100G Trunk/L2 node2 e3a swp15 100G BondMember sw2 swp15 swp16 100G BondMember sw2 swp16 cumulus@sw2:~$ net show lldp LocalPort Speed Mode RemoteHost RemotePort --------- ----- ---------- ------------------ ----------- swp3 100G Trunk/L2 node1 e3b swp4 100G Trunk/L2 node2 e3b swp15 100G BondMember sw1 swp15 swp16 100G BondMember sw1 swp16
-
启用集群 LIF 的自动回滚功能:
cluster1::*> network interface modify -vserver Cluster -lif * -auto-revert true -
确认所有集群网络 LIF 都已恢复到其原端口:
network interface showcluster1::*> network interface show -vserver Cluster Logical Status Network Current Current Is Vserver Interface Admin/Oper Address/Mask Node Port Home ----------- ---------- ---------- ------------------ ------------- ------- ---- Cluster node1_clus1 up/up 169.254.209.69/16 node1 e3a true node1_clus2 up/up 169.254.49.125/16 node1 e3b true node2_clus1 up/up 169.254.47.194/16 node2 e3a true node2_clus2 up/up 169.254.19.183/16 node2 e3b true -
启用以太网交换机健康监控日志收集功能,以收集与交换机相关的日志文件,使用以下两个命令:
system switch ethernet log setup-password`和 `system switch ethernet log enable-collection进入:
system switch ethernet log setup-passwordcluster1::*> system switch ethernet log setup-password Enter the switch name: <return> The switch name entered is not recognized. Choose from the following list: sw1 sw2 cluster1::*> system switch ethernet log setup-password Enter the switch name: sw1 RSA key fingerprint is e5:8b:c6:dc:e2:18:18:09:36:63:d9:63:dd:03:d9:cc Do you want to continue? {y|n}::[n] y Enter the password: <enter switch password> Enter the password again: <enter switch password> cluster1::*> system switch ethernet log setup-password Enter the switch name: sw2 RSA key fingerprint is 57:49:86:a1:b9:80:6a:61:9a:86:8e:3c:e3:b7:1f:b1 Do you want to continue? {y|n}:: [n] y Enter the password: <enter switch password> Enter the password again: <enter switch password>其次是:
system switch ethernet log enable-collectioncluster1::*> system switch ethernet log enable-collection Do you want to enable cluster log collection for all nodes in the cluster? {y|n}: [n] y Enabling cluster switch log collection. cluster1::*>如果这些命令中的任何一条返回错误,请联系NetApp支持。 -
启动交换机日志收集功能:
system switch ethernet log collect -device *等待 10 分钟,然后使用以下命令检查日志收集是否成功:
system switch ethernet log showcluster1::*> system switch ethernet log show Log Collection Enabled: true Index Switch Log Timestamp Status ------ ---------------------------- ------------------- --------- 1 sw1 (b8:ce:f6:19:1b:42) 4/29/2022 03:05:25 complete 2 sw2 (b8:ce:f6:19:1b:96) 4/29/2022 03:07:42 complete
-
将权限级别改回管理员:
set -privilege admin -
如果您已禁用自动创建案例功能,请通过调用AutoSupport消息重新启用该功能:
system node autosupport invoke -node * -type all -message MAINT=END