从Cisco集群交换机迁移到NVIDIA SN2100集群交换机
您可以将ONTAP 集群的无中断Cisco集群交换机迁移到NVIDIA SN2100集群交换机。在将某些旧的Cisco集群交换机更换为NVIDIA SN2100集群交换机时、您必须了解某些配置信息、端口连接和布线要求。
支持以下Cisco集群交换机:
-
Nexus 9336C-x2
-
Nexus 92300YC
-
Nexus 5596UP
-
Nexus 3232C
-
Nexus 3132Q-V
您可以将ONTAP 集群中较早的Cisco集群交换机无中断迁移到NVIDIA SN2100集群交换机。
-
现有集群必须已正确设置并正常运行。
-
所有集群端口都必须处于 up 状态,以确保无中断运行。
-
NVIDIA SN2100集群交换机必须在安装了正确版本的Cumulus Linux且应用了参考配置文件(Reference Configuration File、RCF)的情况下进行配置和运行。
-
现有集群网络配置必须满足以下要求:
-
使用两个旧版 Cisco 交换机的冗余且功能完备的 NetApp 集群。
-
管理连接以及对旧 Cisco 交换机和新交换机的控制台访问。
-
所有与集群 LIF 处于 up 状态的集群 LIF 均位于其主端口上。
-
ISL 端口已启用,并已在旧的 Cisco 交换机之间以及新交换机之间进行布线。
-
-
请参见 "Hardware Universe" 有关支持的端口及其配置的完整详细信息。
-
您已将NVIDIA SN2100交换机上的某些端口配置为以40 GbE或100 GbE运行。
-
您已规划、迁移并记录了从节点到NVIDIA SN2100集群交换机的40 GbE和100 GbE连接。
在此操作步骤 中、使用Cisco Nexus 3232C集群交换机作为示例命令和输出。
此操作步骤中的示例使用以下交换机和节点命名:
-
现有 Cisco Nexus 3232C 集群交换机为 c1 和 c2 。
-
新的NVIDIA SN2100集群交换机为_sw1_和_SW2_。
-
节点为 node1 和 node2 。
-
集群 LIF 分别是节点 1 上的 node1_clus1 和 node1_clus2 以及节点 2 上的 node2_clus1 和 node2_clus2 。
-
cluster1 ::: * >
提示符指示集群的名称。 -
此操作步骤 中使用的集群端口为_e3A_和_e3b_。
-
分支端口采用以下格式:swp[port]s[分支端口0-3]。例如、swp1上的四个分支端口为_swp1s0_、swp1s1、swp1s2_和_swp1s3。
-
交换机C2首先由交换机SW2取代、然后交换机C1由交换机sw1取代。
-
然后、节点和C2之间的布线将从C2断开、并重新连接到SW2。
-
然后、节点和C1之间的布线将从C1断开、并重新连接到sw1。
-
-
如果在此集群上启用了 AutoSupport ,请通过调用 AutoSupport 消息来禁止自动创建案例:
ssystem node AutoSupport invoke -node * -type all -message MAINT=xh
其中 x 是维护时段的持续时间,以小时为单位。
-
将权限级别更改为高级,在系统提示您继续时输入 * y* :
set -privilege advanced
此时将显示高级提示符( * > )。
-
在集群LIF上禁用自动还原:
network interface modify -vserver cluster -lif *-auto-revert false
cluster1::*> network interface modify -vserver Cluster -lif * -auto-revert false Warning: Disabling the auto-revert feature of the cluster logical interface may effect the availability of your cluster network. Are you sure you want to continue? {y|n}: y
-
确定每个集群接口的管理或运行状态:
对于
Link
,每个端口均应显示 up ;对于Health Status
,每个端口均应显示 Healthy 。-
显示网络端口属性:
network port show -ipspace cluster
cluster1::*> network port show -ipspace Cluster Node: node1 Ignore Speed(Mbps) Health Health Port IPspace Broadcast Domain Link MTU Admin/Oper Status Status --------- ---------- ---------------- ---- ----- ------------ -------- ------ e3a Cluster Cluster up 9000 auto/100000 healthy false e3b Cluster Cluster up 9000 auto/100000 healthy false Node: node2 Ignore Speed(Mbps) Health Health Port IPspace Broadcast Domain Link MTU Admin/Oper Status Status --------- ---------- ---------------- ---- ----- ------------ -------- ------ e3a Cluster Cluster up 9000 auto/100000 healthy false e3b Cluster Cluster up 9000 auto/100000 healthy false
-
显示有关逻辑接口及其指定主节点的信息:
network interface show -vserver Cluster
对于
S状态管理 /Oper
,每个 LIF 都应显示 up/up ;对于,每个 LIF 都应显示 true ,而对于
,则显示 Home 。
cluster1::*> network interface show -vserver Cluster Logical Status Network Current Current Is Vserver Interface Admin/Oper Address/Mask Node Port Home ----------- ----------- ---------- ------------------ ----------- ------- ---- Cluster node1_clus1 up/up 169.254.209.69/16 node1 e3a true node1_clus2 up/up 169.254.49.125/16 node1 e3b true node2_clus1 up/up 169.254.47.194/16 node2 e3a true node2_clus2 up/up 169.254.19.183/16 node2 e3b true
-
-
从节点的角度来看、每个节点上的集群端口均使用以下命令连接到现有集群交换机:
network device-discovery show -protocol LLDP
cluster1::*> network device-discovery show -protocol lldp Node/ Local Discovered Protocol Port Device (LLDP: ChassisID) Interface Platform ----------- ------ ------------------------- ---------------- ---------------- node1 /lldp e3a c1 (6a:ad:4f:98:3b:3f) Eth1/1 - e3b c2 (6a:ad:4f:98:4c:a4) Eth1/1 - node2 /lldp e3a c1 (6a:ad:4f:98:3b:3f) Eth1/2 - e3b c2 (6a:ad:4f:98:4c:a4) Eth1/2 -
-
从交换机角度来看,集群端口和交换机使用以下命令进行连接:
show cdp neighbors
c1# show cdp neighbors Capability Codes: R - Router, T - Trans-Bridge, B - Source-Route-Bridge S - Switch, H - Host, I - IGMP, r - Repeater, V - VoIP-Phone, D - Remotely-Managed-Device, s - Supports-STP-Dispute Device-ID Local Intrfce Hldtme Capability Platform Port ID node1 Eth1/1 124 H AFF-A400 e3a node2 Eth1/2 124 H AFF-A400 e3a c2 Eth1/31 179 S I s N3K-C3232C Eth1/31 c2 Eth1/32 175 S I s N3K-C3232C Eth1/32 c2# show cdp neighbors Capability Codes: R - Router, T - Trans-Bridge, B - Source-Route-Bridge S - Switch, H - Host, I - IGMP, r - Repeater, V - VoIP-Phone, D - Remotely-Managed-Device, s - Supports-STP-Dispute Device-ID Local Intrfce Hldtme Capability Platform Port ID node1 Eth1/1 124 H AFF-A400 e3b node2 Eth1/2 124 H AFF-A400 e3b c1 Eth1/31 175 S I s N3K-C3232C Eth1/31 c1 Eth1/32 175 S I s N3K-C3232C Eth1/32
-
使用命令验证集群网络是否已完全连接:
cluster ping-cluster -node node-name
cluster1::*> cluster ping-cluster -node node2 Host is node2 Getting addresses from network interface table... Cluster node1_clus1 169.254.209.69 node1 e3a Cluster node1_clus2 169.254.49.125 node1 e3b Cluster node2_clus1 169.254.47.194 node2 e3a Cluster node2_clus2 169.254.19.183 node2 e3b Local = 169.254.47.194 169.254.19.183 Remote = 169.254.209.69 169.254.49.125 Cluster Vserver Id = 4294967293 Ping status: .... Basic connectivity succeeds on 4 path(s) Basic connectivity fails on 0 path(s) ................ Detected 9000 byte MTU on 4 path(s): Local 169.254.19.183 to Remote 169.254.209.69 Local 169.254.19.183 to Remote 169.254.49.125 Local 169.254.47.194 to Remote 169.254.209.69 Local 169.254.47.194 to Remote 169.254.49.125 Larger than PMTU communication succeeds on 4 path(s) RPC status: 2 paths up, 0 paths down (tcp check) 2 paths up, 0 paths down (udp check)
-
在交换机C2上、关闭连接到节点集群端口的端口。
(c2)# configure Enter configuration commands, one per line. End with CNTL/Z. (c2)(Config)# interface (c2)(config-if-range)# shutdown <interface_list> (c2)(config-if-range)# exit (c2)(Config)# exit (c2)#
-
使用NVIDIA SN2100支持的适当布线将节点集群端口从旧交换机C2移至新交换机SW2。
-
显示网络端口属性:
network port show -ipspace cluster
cluster1::*> network port show -ipspace Cluster Node: node1 Ignore Speed(Mbps) Health Health Port IPspace Broadcast Domain Link MTU Admin/Oper Status Status --------- ---------- ---------------- ---- ----- ------------ -------- ------ e3a Cluster Cluster up 9000 auto/100000 healthy false e3b Cluster Cluster up 9000 auto/100000 healthy false Node: node2 Ignore Speed(Mbps) Health Health Port IPspace Broadcast Domain Link MTU Admin/Oper Status Status --------- ---------- ---------------- ---- ----- ------------ -------- ------ e3a Cluster Cluster up 9000 auto/100000 healthy false e3b Cluster Cluster up 9000 auto/100000 healthy false
-
现在,从节点的角度来看,每个节点上的集群端口均以以下方式连接到集群交换机:
cluster1::*> network device-discovery show -protocol lldp Node/ Local Discovered Protocol Port Device (LLDP: ChassisID) Interface Platform ----------- ------ ------------------------- ---------------- ---------------- node1 /lldp e3a c1 (6a:ad:4f:98:3b:3f) Eth1/1 - e3b sw2 (b8:ce:f6:19:1a:7e) swp3 - node2 /lldp e3a c1 (6a:ad:4f:98:3b:3f) Eth1/2 - e3b sw2 (b8:ce:f6:19:1b:96) swp4 -
-
在交换机SW2上、验证所有节点集群端口是否均已启动:
net show interface
cumulus@sw2:~$ net show interface State Name Spd MTU Mode LLDP Summary ----- ----------- ---- ----- ---------- ----------------- ---------------------- ... ... UP swp3 100G 9216 Trunk/L2 e3b Master: bridge(UP) UP swp4 100G 9216 Trunk/L2 e3b Master: bridge(UP) UP swp15 100G 9216 BondMember sw1 (swp15) Master: cluster_isl(UP) UP swp16 100G 9216 BondMember sw1 (swp16) Master: cluster_isl(UP)
-
在交换机C1上、关闭连接到节点集群端口的端口。
(c1)# configure Enter configuration commands, one per line. End with CNTL/Z. (c1)(Config)# interface (c1)(config-if-range)# shutdown <interface_list> (c1)(config-if-range)# exit (c1)(Config)# exit (c1)#
-
使用NVIDIA SN2100支持的适当布线将节点集群端口从旧交换机C1移至新交换机sw1。
-
验证集群的最终配置:
network port show -ipspace cluster
对于
Link
,每个端口均应显示 up ;对于Health Status
,每个端口均应显示 Healthy 。cluster1::*> network port show -ipspace Cluster Node: node1 Ignore Speed(Mbps) Health Health Port IPspace Broadcast Domain Link MTU Admin/Oper Status Status --------- ---------- ---------------- ---- ----- ------------ -------- ------ e3a Cluster Cluster up 9000 auto/100000 healthy false e3b Cluster Cluster up 9000 auto/100000 healthy false Node: node2 Ignore Speed(Mbps) Health Health Port IPspace Broadcast Domain Link MTU Admin/Oper Status Status --------- ---------- ---------------- ---- ----- ------------ -------- ------ e3a Cluster Cluster up 9000 auto/100000 healthy false e3b Cluster Cluster up 9000 auto/100000 healthy false
-
现在,从节点的角度来看,每个节点上的集群端口均以以下方式连接到集群交换机:
cluster1::*> network device-discovery show -protocol lldp Node/ Local Discovered Protocol Port Device (LLDP: ChassisID) Interface Platform ----------- ------ ------------------------- -------------- ---------------- node1 /lldp e3a sw1 (b8:ce:f6:19:1a:7e) swp3 - e3b sw2 (b8:ce:f6:19:1b:96) swp3 - node2 /lldp e3a sw1 (b8:ce:f6:19:1a:7e) swp4 - e3b sw2 (b8:ce:f6:19:1b:96) swp4 -
-
在交换机sw1和SW2上、验证所有节点集群端口是否均已启动:
net show interface
cumulus@sw1:~$ net show interface State Name Spd MTU Mode LLDP Summary ----- ----------- ---- ----- ---------- ----------------- ---------------------- ... ... UP swp3 100G 9216 Trunk/L2 e3a Master: bridge(UP) UP swp4 100G 9216 Trunk/L2 e3a Master: bridge(UP) UP swp15 100G 9216 BondMember sw2 (swp15) Master: cluster_isl(UP) UP swp16 100G 9216 BondMember sw2 (swp16) Master: cluster_isl(UP) cumulus@sw2:~$ net show interface State Name Spd MTU Mode LLDP Summary ----- ----------- ---- ----- ---------- ----------------- ----------------------- ... ... UP swp3 100G 9216 Trunk/L2 e3b Master: bridge(UP) UP swp4 100G 9216 Trunk/L2 e3b Master: bridge(UP) UP swp15 100G 9216 BondMember sw1 (swp15) Master: cluster_isl(UP) UP swp16 100G 9216 BondMember sw1 (swp16) Master: cluster_isl(UP)
-
验证两个节点的每个交换机是否都有一个连接:
net show lldp
以下示例显示了这两个交换机的相应结果:
cumulus@sw1:~$ net show lldp LocalPort Speed Mode RemoteHost RemotePort --------- ----- ---------- ------------------ ----------- swp3 100G Trunk/L2 node1 e3a swp4 100G Trunk/L2 node2 e3a swp15 100G BondMember sw2 swp15 swp16 100G BondMember sw2 swp16 cumulus@sw2:~$ net show lldp LocalPort Speed Mode RemoteHost RemotePort --------- ----- ---------- ------------------ ----------- swp3 100G Trunk/L2 node1 e3b swp4 100G Trunk/L2 node2 e3b swp15 100G BondMember sw1 swp15 swp16 100G BondMember sw1 swp16
-
在集群LIF上启用自动还原:
cluster1:::*> network interface modify -vserver cluster -lif *-auto-revert true
-
验证所有集群网络LIF是否均已返回其主端口:
network interface show
cluster1::*> network interface show -vserver Cluster Logical Status Network Current Current Is Vserver Interface Admin/Oper Address/Mask Node Port Home ----------- ---------- ---------- ------------------ ------------- ------- ---- Cluster node1_clus1 up/up 169.254.209.69/16 node1 e3a true node1_clus2 up/up 169.254.49.125/16 node1 e3b true node2_clus1 up/up 169.254.47.194/16 node2 e3a true node2_clus2 up/up 169.254.19.183/16 node2 e3b true
-
使用以下两个命令启用以太网交换机运行状况监控器日志收集功能以收集交换机相关的日志文件:
ssystem switch Ethernet log setup-password`和`ssystem switch Ethernet log enable-Collection
输入:
ssystem switch Ethernet log setup-password
cluster1::*> system switch ethernet log setup-password Enter the switch name: <return> The switch name entered is not recognized. Choose from the following list: sw1 sw2 cluster1::*> system switch ethernet log setup-password Enter the switch name: sw1 RSA key fingerprint is e5:8b:c6:dc:e2:18:18:09:36:63:d9:63:dd:03:d9:cc Do you want to continue? {y|n}::[n] y Enter the password: <enter switch password> Enter the password again: <enter switch password> cluster1::*> system switch ethernet log setup-password Enter the switch name: sw2 RSA key fingerprint is 57:49:86:a1:b9:80:6a:61:9a:86:8e:3c:e3:b7:1f:b1 Do you want to continue? {y|n}:: [n] y Enter the password: <enter switch password> Enter the password again: <enter switch password>
后跟:
ssystem switch Ethernet log enable-Collection
cluster1::*> system switch ethernet log enable-collection Do you want to enable cluster log collection for all nodes in the cluster? {y|n}: [n] y Enabling cluster switch log collection. cluster1::*>
如果其中任何一个命令返回错误,请联系 NetApp 支持部门。 -
启动交换机日志收集功能:
ssystem switch Ethernet log Collect -device *
等待10分钟、然后使用命令`ssystem switch Ethernet log show`检查日志收集是否成功
cluster1::*> system switch ethernet log show Log Collection Enabled: true Index Switch Log Timestamp Status ------ ---------------------------- ------------------- --------- 1 sw1 (b8:ce:f6:19:1b:42) 4/29/2022 03:05:25 complete 2 sw2 (b8:ce:f6:19:1b:96) 4/29/2022 03:07:42 complete
-
将权限级别重新更改为 admin :
set -privilege admin
-
如果禁止自动创建案例,请通过调用 AutoSupport 消息重新启用该功能:
ssystem node AutoSupport invoke -node * -type all -message MAINT=end