将Oracle Linux 7.6与ONTAP结合使用
您可以使用ONTAP SAN主机配置设置将以ONTAP为目标的Oracle Linux 7.6配置为目标。
安装 Linux Unified Host Utilities
NetApp LINUX统一主机实用程序软件包以32位和64位.rpm文件的形式在上提供"NetApp 支持站点"。如果您不知道哪个文件适合您的配置、请使用验证您需要哪个文件"NetApp 互操作性表工具"。
NetApp强烈建议安装Linux统一主机实用程序、但这并不是必需的。这些实用程序不会更改Linux主机上的任何设置。这些实用程序可改进管理并帮助 NetApp 客户支持收集有关您的配置的信息。
如果您当前已安装Linux Unified Host Utilities、则应将其升级到最新版本或将其删除、然后按照以下步骤安装最新版本。
-
从下载 32 位或 64 位 Linux Unified Host Utilities 软件包 "NetApp 支持站点" 主机。
-
安装软件包:
rpm -ivh netapp_linux_unified_host_utilitis-7-1.x86_64
您可以使用本文档中提供的配置设置来配置连接到的云客户端 "Cloud Volumes ONTAP" 和 "适用于 ONTAP 的 Amazon FSX"。 |
SAN 工具包
安装 NetApp Host Utilities 软件包时,工具包会自动安装。此套件提供 sanlun
实用程序,可帮助您管理 LUN 和 HBA 。sanlun
命令可返回有关映射到主机的 LUN 的信息,多路径以及创建启动程序组所需的信息。
在以下示例中, sanlun lun show
命令将返回 LUN 信息。
# sanlun lun show all
示例输出:
controller(7mode/E-Series)/ device host lun vserver(cDOT/FlashRay) lun-pathname filename adapter protocol size Product ------------------------------------------------------------------------------------ data_vserver /vol/vol1/lun1 /dev/sdb host16 FCP 120.0g cDOT data_vserver /vol/vol1/lun1 /dev/sdc host15 FCP 120.0g cDOT data_vserver /vol/vol2/lun2 /dev/sdd host16 FCP 120.0g cDOT data_vserver /vol/vol2/lun2 /dev/sde host15 FCP 120.0g cDOT
SAN 启动
如果您决定使用 SAN 启动,则配置必须支持它。您可以使用 "NetApp 互操作性表工具" 验证您的操作系统, HBA , HBA 固件和 HBA 启动 BIOS 以及 ONTAP 版本是否受支持。
-
将 SAN 启动 LUN 映射到主机。
-
验证是否有多个可用路径。
在主机操作系统启动并运行多个路径后、这些路径将变为可用。 -
在服务器 BIOS 中为 SAN 启动 LUN 映射到的端口启用 SAN 启动。
有关如何启用 HBA BIOS 的信息,请参见供应商专用文档。
-
重新启动主机以验证启动是否成功。
多路径
对于 Oracle Linux 7.6 , /etc/multipath.conf 文件必须存在,但您不需要对该文件进行特定更改。Oracle Linux 7.6 会使用识别和正确管理 ONTAP LUN 所需的所有设置进行编译。
您可以使用 `multipath -ll`命令验证ONTAP LUN的设置。应存在两组具有不同优先级的路径。优先级较高的路径为主动/优化路径、这意味着它们由聚合所在的控制器提供服务。优先级较低的路径处于活动状态、但未进行优化、因为它们是从其他控制器提供的。只有在优化路径不可用时、才会使用非优化路径。
以下示例显示了具有两个主动 / 优化路径和两个主动 / 非优化路径的 ONTAP LUN 的正确输出:
# multipath -ll 3600a09803831347657244e527766394e dm-5 NETAPP,LUN C-Mode size=80G features='4 queue_if_no_path pg_init_retries 50 retain_attached_hw_handle' hwhandler='1 alua' wp=rw |-+- policy='service-time 0' prio=50 status=active | |- 11:0:1:0 sdj 8:144 active ready running | |- 11:0:2:0 sdr 65:16 active ready running |-+- policy='service-time 0' prio=10 status=enabled |- 11:0:0:0 sdb 8:i6 active ready running |- 12:0:0:0 sdz 65:144 active ready running
一个LUN不应需要四个以上的路径。如果路径数超过四个、则可能会在存储故障期间导致路径问题。 |
建议设置
经过编译的Oracle Linux 7.6操作系统可识别ONTAP LUN并自动正确设置所有配置参数。 multipath.conf`要启动多路径守护进程、必须存在该文件。如果此文件不存在、您可以使用命令创建一个空的零字节文件 `touch /etc/multipath.conf
。
首次创建 `multipath.conf`文件时、可能需要使用以下命令启用并启动多路径服务:
chkconfig multipathd on /etc/init.d/multipathd start
您无需直接向文件中添加任何内容 multipath.conf
、除非您的设备不需要多路径管理、或者您的现有设置会覆盖默认值。要排除不需要的设备、请在文件中添加以下语法 multipath.conf
、将<DevId>替换为要排除的设备的全球通用标识符(WWID)字符串:
blacklist { wwid <DevId> devnode "^(ram|raw|loop|fd|md|dm-|sr|scd|st)[0-9]*" devnode "^hd[a-z]" devnode "^cciss.*" }
以下示例将确定设备的WWID并将其添加到文件中 multipath.conf
。
-
确定WWID:
/lib/udev/scsi_id -gud /dev/sda
3600a098038314c4a433f5774717a3046 /lib/udev/scsi_id -gud /dev/sda
360030057024d0730239134810c0cb833
+ `sda` is the local SCSI disk that you want to add to the blacklist. . Add the `WWID` to the blacklist stanza in `/etc/multipath.conf`: [source,cli] +
黑名单{ wwid 3600a098038314c4a433f5774717a3046 devnode "^(ram|raw|lo|fd|m|dm-|sr|scd|st)" devnode "^hd[a-z]" devnode "^cciss."}
Always check your `/etc/multipath.conf` file, especially in the defaults section, for legacy settings that might be overriding default settings. The following table demonstrates the critical `multipathd` parameters for ONTAP LUNs and the required values. If a host is connected to LUNs from other vendors and any of these parameters are overridden, they must be corrected by later stanzas in the `multipath.conf` file that apply specifically to ONTAP LUNs. Without this correction, the ONTAP LUNs might not work as expected. You should only override these defaults in consultation with NetApp, the OS vendor, or both, and only when the impact is fully understood. //ONTAPDOC-2578 9-Dec-2024 //ONTAPDOC-2561 25-Nov-202 [cols=2*,options="header"] |=== | Parameter | Setting | detect_prio | yes | dev_loss_tmo | "infinity" | failback | immediate | fast_io_fail_tmo | 5 | features | "3 queue_if_no_path pg_init_retries 50" | flush_on_last_del | "yes" | hardware_handler | "0" | path_checker | "tur" | path_grouping_policy | "group_by_prio" | path_selector | "service-time 0" | polling_interval | 5 | prio | "ontap" | product | LUN.* | retain_attached_hw_handler | yes | rr_weight | "uniform" | user_friendly_names | no | vendor | NETAPP |=== .Example The following example shows how to correct an overridden default. In this case, the `multipath.conf` file defines values for `path_checker` and `detect_prio` that are not compatible with ONTAP LUNs. If they cannot be removed because of other SAN arrays still attached to the host, these parameters can be corrected specifically for ONTAP LUNs with a device stanza.
默认值{ path_checkerreadsector0 detect_prio no }设备{设备{供应商“NetApp”产品"LUN。*" path_checkertur detect_prio yes }
NOTE: To configure Oracle Linux 7.6 Red Hat Enterprise Kernel (RHCK), use the link:hu_rhel_76.html#recommended-settings[recommended settings] for Red Hat Enterprise Linux (RHEL) 7.6. == Known issues The Oracle Linux 7.6 with ONTAP release has the following known issues: [cols=3*,options="header"] |=== | NetApp Bug ID | Title | Description | 1440718 | If you unmap or map a LUN without performing a SCSI rescan, it might lead to data corruption on the host. | When you set the 'disable_changed_wwids' multipath configuration parameter to YES, it disables access to the path device in the event of a WWID change. Multipath will disable access to the path device until the WWID of the path is restored to the WWID of the multipath device. To learn more, see link:https://kb.netapp.com/Advice_and_Troubleshooting/Flash_Storage/AFF_Series/The_filesystem_corruption_on_iSCSI_LUN_on_the_Oracle_Linux_7[NetApp Knowledge Base: The filesystem corruption on iSCSI LUN on the Oracle Linux 7^]. | link:https://mysupport.netapp.com/NOW/cgi-bin/bol?Type=Detail&Display=1202736[1202736^] | LUNs might not be available during host discovery due to "Not Present" state of remote ports on a OL7U6 host with QLogic QLE2742 adapter |During host discovery, the status of Fibre Channel (FC) remote ports on a OL7U6 host with a QLogic QLE2742 adapter might enter into "Not Present" state. Remote ports with a "Not Present" state might cause paths to LUNs to become unavailable. During storage failover, the path redundancy might be reduced and result in an I/O outage. You can check the remote port status by entering the following command: # cat /sys/class/fc_remote_ports/rport-*/port_state The following is an example of the output that is displayed: Online Online Not Present Online Online | link:https://mysupport.netapp.com/NOW/cgi-bin/bol?Type=Detail&Display=1204078[1204078^] | Kernel disruption occurs on Oracle Linux 7.6 running with Qlogic(QLE2672) 16GB FC HBA during storage failover operations | During storage failover operations on the Oracle Linux 7.6 with a Qlogic QLE2672 Fibre Channel (FC) host bus adapter (HBA), a kernel disruption occurs due to a panic in the kernel. The kernel panic causes Oracle Linux 7.6 to reboot, which leads to an application disruption. If the kdump mechanism is enabled, the kernel panic generates a vmcore file located in the /var/crash/ directory. You can analyze the vmcore file to determine the cause of the panic. After the kernel disruption, you can reboot the host OS and recover the operating system, and then you can restart any applications as required. | link:https://mysupport.netapp.com/NOW/cgi-bin/bol?Type=Detail&Display=1204351[1204351^] | Kernel disruption might occur on Oracle Linux 7.6 running with Qlogic(QLE2742) 32GB FC HBA during storage failover operations | During storage failover operations on the Oracle Linux 7.6 with a Qlogic QLE2742 Fibre Channel (FC) host bus adapter (HBA), a kernel disruption might occur due to a panic in the kernel. The kernel panic causes Oracle Linux 7.6 to reboot, which leads to an application disruption. If the kdump mechanism is enabled, the kernel panic generates a vmcore file located in the /var/crash/ directory. You can analyze the vmcore file to determine the cause of the panic. After the kernel disruption, you can reboot the host OS and recover the operating system, and then you can restart any applications as required. | link:https://mysupport.netapp.com/NOW/cgi-bin/bol?Type=Detail&Display=1204352[1204352^] | Kernel disruption might occur on Oracle Linux 7.6 running with Emulex (LPe32002-M2)32GB FC HBA during storage failover operations | During storage failover operations on the Oracle Linux 7.6 with an Emulex LPe32002-M2 Fibre Channel (FC) host bus adapter (HBA), a kernel disruption might occur due to a panic in the kernel. The kernel panic causes Oracle Linux 7.6 to reboot, which leads to an application disruption. If the kdump mechanism is enabled, the kernel panic generates a vmcore file located in the /var/crash/ directory. You can analyze the vmcore file to determine the cause of the panic. After the kernel disruption, you can reboot the host OS and recover the operating system, and then you can restart any applications as required. | link:https://mysupport.netapp.com/NOW/cgi-bin/bol?Type=Detail&Display=1246134[11246134^] | No I/O progress on Oracle Linux 7.6 with UEK5U2 kernel, running with an Emulex LPe16002B-M6 16G FC HBA during storage failover operations | During storage failover operations on the Oracle Linux 7.6 with the UEK5U2 kernel running with an Emulex LPe16002B-M6 16G Fibre Channel (FC) host bus adapter (HBA), I/O progress might stop due to reports getting blocked. The storage failover operation reports change from an "online" state to a "blocked" state, causing a delay in read and write operations. After the operation has completed successfully, the reports fail to move back to an "online" state and continue to remain in a "blocked" state. | link:https://mysupport.netapp.com/NOW/cgi-bin/bol?Type=Detail&Display=1246327[1246327^] | Remote port status on QLogic QLE2672 16G host blocked during storage failover operations | Fibre Channel (FC) remote ports might be blocked on Red Hat Enterprise Linux (RHEL) 7.6 with the QLogic QLE2672 16G host during storage failover operations. Because the logical interfaces go down when a storage node is down, the remote ports set the storage node status to blocked. IO progress might stop due to the blocked ports if you are running both a QLogic QLE2672 16G host and a QLE2742 32GB Fibre Channel (FC) host bus adapter (HBA). When the storage node returns to its optimal state, the logical interfaces also come up and the remote ports should be online. However, the remote ports might still be blocked. This blocked state registers as failed faulty to LUNS at the multipath layer. You can verify the state of the remote ports with the following command: # cat /sys/class/fc_remote_ports/rport-*/port_stat You should see the following output: Blocked Blocked Blocked Blocked Online Online |=== // 2024 SEP 2, ONTAPDOC-2345 // BURT 1440718, 2022-05-20