使用 SUSE Linux Enterprise Server 12 SP3 搭配 ONTAP
您可以使用 ONTAP SAN 主機組態設定、將 SUSE Linux Enterprise Server 12 SP3 與 ONTAP 設定為目標。
安裝Linux Unified Host Utilities
NetApp Linux Unified Host Utilities 軟體套件可在 32 位元和 64 位元的 .rpm 檔案中找"NetApp 支援網站"到。如果您不知道哪一個檔案適合您的組態,請使用來驗證您需要的檔案"NetApp 互通性對照表工具"。
NetApp 強烈建議您安裝 Linux 統一化主機公用程式,但並非強制安裝。這些公用程式不會變更 Linux 主機上的任何設定。這些公用程式可改善管理、並協助NetApp客戶支援部門收集您的組態相關資訊。
如果您目前已安裝 Linux Unified Host Utilities ,您應該將其升級至最新版本,或是將其移除,然後依照下列步驟安裝最新版本。
-
從下載32位元或64位元Linux Unified Host Utilities軟體套件 "NetApp 支援網站" 到您的主機。
-
安裝軟體套件:
「rpm -ivh netapp_Linux統一化_host_utilities - 7-1.x86_64」
您可以使用本文所提供的組態設定來設定連線至的雲端用戶端 "Cloud Volumes ONTAP" 和 "Amazon FSX for ONTAP Sf"。 |
SAN工具套件
當您安裝NetApp主機公用程式套件時、會自動安裝此工具套件。此套件提供「資源」公用程式、可協助您管理LUN和HBA。「lanlun」命令會傳回對應至主機的LUN資訊、多重路徑、以及建立啟動器群組所需的資訊。
在以下範例中、「左LUN show」命令會傳回LUN資訊。
# sanlun lun show all
輸出範例:
controller(7mode/E-Series)/ device host lun vserver(cDOT/FlashRay) lun-pathname filename adapter protocol size Product ------------------------------------------------------------------------------------ data_vserver /vol/vol1/lun1 /dev/sdb host16 FCP 120.0g cDOT data_vserver /vol/vol1/lun1 /dev/sdc host15 FCP 120.0g cDOT data_vserver /vol/vol2/lun2 /dev/sdd host16 FCP 120.0g cDOT data_vserver /vol/vol2/lun2 /dev/sde host15 FCP 120.0g cDOT
SAN開機
如果您決定使用SAN開機、則組態必須支援SAN開機。您可以使用 "NetApp 互通性對照表工具" 驗證是否ONTAP 支援您的作業系統、HBA、HBA韌體和HBA開機BIOS及BIOS版本。
-
將SAN開機LUN對應至主機。
-
確認有多個路徑可供使用。
在主機作業系統啟動並在路徑上執行之後,就可以使用多個路徑。 -
在伺服器BIOS中為SAN開機LUN對應的連接埠啟用SAN開機。
如需如何啟用HBA BIOS的相關資訊、請參閱廠商專屬的文件。
-
重新啟動主機、確認開機成功。
多重路徑
對於SUSE Linux Enterprise Server 12 SP3、/etc/multipath.conf檔案必須存在、但您不需要對檔案進行特定變更。SUSE Linux Enterprise Server 12 SP3的所有設定都已經過編譯、可辨識及正確管理ONTAP 各種LUN。
您可以使用「multiPath -ll」命令來驗證ONTAP 您的各個LUN的設定。
下列各節提供對應至 ASA 和非 ASA 角色之 LUN 的多重路徑輸出範例。
所有 SAN 陣列組態
所有 SAN 陣列( ASA )組態都會最佳化通往指定 LUN 的所有路徑,使其保持作用中。如此可同時透過所有路徑提供 I/O 作業、進而提升效能。
以下範例顯示 ONTAP LUN 的正確輸出。
# multipath -ll 3600a0980383034466b2b4a3775474859 dm-3 NETAPP,LUN C-Mode size=20G features='4 queue_if_no_path pg_init_retries 50 retain_attached_hw_handle' hwhandler='1 alua' wp=rw |-+- policy='round-robin 0' prio=50 status=active | |- 1:0:8:1 sdb 8:16 active ready running | `- 2:0:8:1 sdd 8:48 active ready running `-+- policy='round-robin 0' prio=10 status=enabled |- 1:0:9:1 sdc 8:32 active ready running `- 2:0:9:1 sde 8:64 active ready running
單一 LUN 不應需要四個以上的路徑。如果路徑超過四條,可能會在儲存設備故障期間造成路徑問題。 |
非 ASA 組態
對於非 ASA 組態、應該有兩個路徑群組、其優先順序不同。優先順序較高的路徑為主動 / 最佳化、表示它們由集合所在的控制器提供服務。優先順序較低的路徑是作用中的、但未最佳化、因為它們是由不同的控制器提供服務。非最佳化路徑只有在最佳化路徑無法使用時才會使用。
下列範例顯示ONTAP 使用兩個主動/最佳化路徑和兩個主動/非最佳化路徑的正確輸出。
# multipath -ll 3600a09803831347657244e527766394e dm-5 NETAPP,LUN C-Mode size=80G features='4 queue_if_no_path pg_init_retries 50 retain_attached_hw_handler' hwhandler='1 alua' wp=rw |-+- policy='service-time 0' prio=50 status=active | |- 11:0:1:0 sdj 8:144 active ready running | |- 11:0:2:0 sdr 65:16 active ready running `-+- policy='service-time 0' prio=10 status=enabled |- 11:0:0:0 sdb 8:i6 active ready running |- 12:0:0:0 sdz 65:144 active ready running
單一 LUN 不應需要四個以上的路徑。如果路徑超過四條,可能會在儲存設備故障期間造成路徑問題。 |
建議設定
SUSE Linux Enterprise Server 12 SP3 作業系統的編譯是為了識別 ONTAP LUN 、並自動正確設定所有組態參數。該 multipath.conf`檔案必須存在、多重路徑常駐程式才能啟動。如果此檔案不存在,您可以使用命令建立空白的零位元組檔案 `touch /etc/multipath.conf
。
第一次建立 `multipath.conf`檔案時、您可能需要使用下列命令來啟用和啟動多重路徑服務:
chkconfig multipathd on /etc/init.d/multipathd start
您不需要直接將任何內容新增至 `multipath.conf`檔案,除非您有不想要多重路徑管理的裝置,或現有的設定會覆寫預設值。若要排除不想要的裝置,請將下列語法新增至 `multipath.conf`檔案,以您要排除的裝置的全球識別碼( WWID )字串取代 <DevId> :
blacklist { wwid <DevId> devnode "^(ram|raw|loop|fd|md|dm-|sr|scd|st)[0-9]*" devnode "^hd[a-z]" devnode "^cciss.*" }
以下範例決定裝置的 WWID 、並將其新增至 `multipath.conf`檔案。
-
判斷 WWID :
/lib/udev/scsi_id -gud /dev/sda
3600a098038314c4a433f5774717a3046 /lib/udev/scsi_id -gud /dev/sda
360030057024d0730239134810c0cb833
+ `sda` is the local SCSI disk that you want to add to the blacklist. . Add the `WWID` to the blacklist stanza in `/etc/multipath.conf`: [source,cli] +
黑名單 { WWID 3600a098038314c4a433f5774717a3046 devnode " ^( RAM|RAW|REW|FD|MD|dm-|SR|SCD|st)" devnode "^HD[a-z]" devnode "^ccis."}
Always check your `/etc/multipath.conf` file, especially in the defaults section, for legacy settings that might be overriding default settings. The following table demonstrates the critical `multipathd` parameters for ONTAP LUNs and the required values. If a host is connected to LUNs from other vendors and any of these parameters are overridden, they must be corrected by later stanzas in the `multipath.conf` file that apply specifically to ONTAP LUNs. Without this correction, the ONTAP LUNs might not work as expected. You should only override these defaults in consultation with NetApp, the OS vendor, or both, and only when the impact is fully understood. //ONTAPDOC-2578 9-Dec-2024 //ONTAPDOC-2561 25-Nov-202 [cols=2*,options="header"] |=== | Parameter | Setting | detect_prio | yes | dev_loss_tmo | "infinity" | failback | immediate | fast_io_fail_tmo | 5 | features | "2 pg_init_retries 50" | flush_on_last_del | "yes" | hardware_handler | "0" | no_path_retry | queue | path_checker | "tur" | path_grouping_policy | "group_by_prio" | path_selector | "service-time 0" | polling_interval | 5 | prio | "ontap" | product | LUN.* | retain_attached_hw_handler | yes | rr_weight | "uniform" | user_friendly_names | no | vendor | NETAPP |=== .Example The following example shows how to correct an overridden default. In this case, the `multipath.conf` file defines values for `path_checker` and `no_path_retry` that are not compatible with ONTAP LUNs. If they cannot be removed because of other SAN arrays still attached to the host, these parameters can be corrected specifically for ONTAP LUNs with a device stanza.
預設 { path_checker_readsector0 no_path_retry fail } 裝置 { 裝置 { 廠商「 NetApp 」產品 "lun.*" no_path_retry queue path_checkur } }
== Known issues The SUSE Linux Enterprise Server 15 SP3 with ONTAP release has the following known issues: [cols=3*,options="header"] |=== | NetApp Bug ID | Title | Description | link:https://mysupport.netapp.com/NOW/cgi-bin/bol?Type=Detail&Display=1089555[1089555^] | Kernel disruption observed on kernel version SLES12 SP3 with Emulex LPe16002 16GB FC during storage failover operation | A kernel disruption might occur during storage failover operations on kernel version SLES12 SP3 with Emulex LPe16002 HBA. The kernel disruption prompts a reboot of the operating system, which in turn causes an application disruption. If the kdump is configured, the kernel disruption generates a vmcore file under /var/crash/directory. You can investigate the cause of the failure in the vmcore file. Example: In the observed case, the kernel disruption was observed in the module “lpfc_sli_ringtxcmpl_put+51” and is logged in the vmcore file – exception RIP: lpfc_sli_ringtxcmpl_put+51. Recover the operating system after the kernel disruption by rebooting the host operating system and restarting the application. | link:https://mysupport.netapp.com/NOW/cgi-bin/bol?Type=Detail&Display=1089561[1089561^] | Kernel disruption observed on kernel version SLES12 SP3 with Emulex LPe32002 32GB FC during storage failover operations | A kernel disruption might occur during storage failover operations on kernel version SLES12 SP3 with Emulex LPe32002 HBA. The kernel disruption prompts a reboot of the operating system, which in turn causes an application disruption. If the kdump is configured, the kernel disruption generates a vmcore file under /var/crash/directory. You can investigate the cause of the failure in the vmcore file. Example: In the observed case, the kernel disruption was observed in the module “lpfc_sli_free_hbq+76” and is logged in the vmcore file – exception RIP: lpfc_sli_free_hbq+76. Recover the operating system after the kernel disruption by rebooting the host operating system and restarting the application. | link:https://mysupport.netapp.com/NOW/cgi-bin/bol?Type=Detail&Display=1117248[1117248^] | Kernel disruption observed on SLES12SP3 with QLogic QLE2562 8GB FC during storage failover operations | During storage failover operations on the Sles12sp3 kernel (kernel-default-4.4.82-6.3.1) with QLogic QLE2562 HBA, the kernel disruption was observed due to a panic in the kernel. The kernel panic leads to a reboot of the operating system, causing an application disruption. The kernel panic generates the vmcore file under the /var/crash/ directory if kdump is configured. Upon the kernel panic, the vmcore file can be used to understand the cause of the failure. Example: In this case, the panic was observed in the “blk_finish_request+289” module. It is logged in the vmcore file with the following string: "exception RIP: blk_finish_request+289" After the kernel disruption, you can recover the operating system by rebooting the Host OS. You can restart the application as required. | link:https://mysupport.netapp.com/NOW/cgi-bin/bol?Type=Detail&Display=1117261[1117261^] | Kernel disruption observed on SLES12SP3 with Qlogic QLE2662 16GB FC during storage failover operations | During storage failover operations on Sles12sp3 kernel (kernel-default-4.4.82-6.3.1) with Qlogic QLE2662 HBA, you might observe kernel disruption. This prompts a reboot of the operating system causing application disruption. The kernel disruption generates a vmcore file under /var/crash/ directory if kdump is configured. The vmcore file can be used to understand the cause of the failure. Example: In this case the Kernel disruption was observed in the module "unknown or invalid address" and is logged in vmcore file with the following string - exception RIP: unknown or invalid address. After kernel disruption, the operating system can be recovered by rebooting the host operating system and restarting the application as required. | link:https://mysupport.netapp.com/NOW/cgi-bin/bol?Type=Detail&Display=1117274[1117274^] | Kernel disruption observed on SLES12SP3 with Emulex LPe16002 16GB FC during storage failover operations | During storage failover operations on Sles12sp3 kernel (kernel-default-4.4.87-3.1) with Emulex LPe16002 HBA, you might observe kernel disruption. This prompts a reboot of the operating system causing application disruption. The kernel disruption generates a vmcore file under the /var/crash/ directory if kdump is configured. The vmcore file can be used to understand the cause of the failure. Example: In this case kernel disruption was observed in the module “raw_spin_lock_irqsave+30” and is logged in the vmcore file with the following string: – exception RIP: _raw_spin_lock_irqsave+30. After kernel disruption, the operating system can be recovered by rebooting the host operating system and restarting the application as required. |=== // 2024 SEP 2, ONTAPDOC-2345