VMware vSphere Metro Storage Cluster with SnapMirror active sync
VMware vSphere Metro Storage Cluster (vMSC) is a stretched cluster solution across different fault domains to provide
* Workload mobility across availability zones or sites.
* downtime avoidance
* disaster avoidance
* fast recovery
This document provides the vMSC implementation details with SnapMirror active sync (SM-as) utilizing System Manager and ONTAP Tools. Further, it shows how the VM can be protected by replicating to third site and manage with SnapCenter Plugin for VMware vSphere.
SnapMirror active sync supports ASA, AFF and FAS storage arrays. It is recommended to use same type (Performance/Capacity models) on both fault domains. Currently, only block protocols like FC and iSCSI are supported. For further support guidelines, refer Interoperability Matrix Tool and Hardware Universe
vMSC supports two different deployment models named Uniform host access and Non-uniform host access. In Uniform host access configuration, every host on the cluster has access to LUN on both fault domains. It is typically used in different availability zones in same datacenter.
In Non-Uniform host access configuration, host has access only to local fault domain. It is typically used in different sites where running multiple cables across the fault domains are restrictive option.
In Non-Uniform host access mode, the VMs will be restarted in other fault domain by vSphere HA. Application availability will be impacted based on its design. Non-Uniform host access mode is supported only with ONTAP 9.15 onwards. |
Prerequisites
If using self-signed certificate, the CA certificate can be retrieved from the <installation path>/ontap_mediator/server_config/ca.crt on mediator VM. |
vMSC non-uniform host access with ONTAP System Manager UI.
Note: ONTAP Tools 10.2 or above can be used to provision stretched datastore with non-uniform host access mode without switching multiple user interfaces. This section is just for reference if ONTAP Tools is not used.
-
Note down one of the iSCSI data lif IP address from the local fault domain storage array.
-
On vSphere host iSCSI Storage Adapter, add that iSCSI IP under the Dynamic Discovery tab.
For Uniform access mode, need to provide the source and target fault domain iSCSI data lif address. -
Repeat the above step on vSphere hosts for the other fault domain adding its local iSCSI data lif IP on Dynamic Discovery tab.
-
With proper network connectivity, four iSCSI connection should exist per vSphere host that has two iSCSI VMKernel nics and two iSCSI data lifs per storage controller.
-
Create LUN using ONTAP System Manager, setup SnapMirror with replication policy AutomatedFailOverDuplex, pick the host initiators and set host proximity.
-
On other fault domain storage array, create the SAN initiator group with its vSphere host initiators and set host proximity.
For Uniform access mode, the igroup can be replicated from source fault domain. -
Map the replicated LUN with same mapping ID as in source fault domain.
-
On vCenter, right click on vSphere Cluster and select Rescan Storage option.
-
On one of the vSphere host in the cluster, check the newly created device shows up with datastore showing Not Consumed.
-
On vCenter, right click on vSphere Cluster and select New Datastore option.
-
On Wizard, remember to provide the datastore name and select the device with right capacity & device id.
-
Verify the datastore is mounted on all hosts on cluster across both fault domains.
The above screenshots shows Active I/O on single controller since we used AFF. For ASA, it will have Active IO on all paths. -
When additional datastores are added, need to remember to expand the existing Consistency Group to have it consistent across the vSphere cluster.
vMSC uniform host access mode with ONTAP Tools.
-
Ensure NetApp ONTAP Tools is deployed and registered to vCenter.
If not, follow ONTAP Tools deployment and Add a vCenter server instance -
Ensure ONTAP Storage systems are registered to ONTAP Tools. This includes both fault domain storage systems and third one for Asynchronous remote replication to use for VM protection with SnapCenter Plugin for VMware vSphere.
If not, follow Add storage backend using vSphere client UI -
Update hosts data to sync with ONTAP Tools and then, create a datastore.
-
To enable SM-as, right click on vSphere cluster and pick Protect cluster on NetApp ONTAP Tools (refer above screenshot)
-
It will show existing datastores for that cluster along with SVM details. The default CG name is <vSphere Cluster name>_<SVM name>. Click on Add Relationship button.
-
Pick the target SVM and set the policy to AutomatedFailOverDuplex for SM-as. There is a toggle switch for Uniform host configuration. Set the proximity for each host.
-
Verify the host promity info and other details. Add another relationship to third site with replication policy of Asynchronous if required. Then, click on Protect.
NOTE: If plan to use SnapCenter Plug-in for VMware vSphere 6.0, the replication needs to be setup at volume level rather than at Consistency Group level. -
With Uniform host access, the host has iSCSI connection to both fault domain storage arrays.
NOTE: The above screenshot is from AFF. If ASA, ACTIVE I/O should be in all paths with proper network connections. -
ONTAP Tools plugin also indicates the volume is protected or not.
-
For more details and to update the host proximity info, Host cluster relationships option under the ONTAP Tools can be utilized.
VM protection with SnapCenter plug-in for VMware vSphere.
SnapCenter Plug-in for VMware vSphere (SCV) 6.0 or above supports SnapMirror active sync and also in combination with SnapMirror Async to replicate to third fault domain.
Supported use-cases include:
* Backup and Restore the VM or Datastore from either of fault domains with SnapMirror active sync.
* Restore resources from third fault domain.
-
Add all the ONTAP Storage Systems planned to use in SCV.
-
Create Policy. Ensure Update SnapMirror after backup is checked for SM-as and also Update SnapVault after backup for Async replication to third fault domain.
-
Create Resource Group with desiered items that need to be protected, associate to policy and schedule.
NOTE: Snapshot name ending with _recent is not supported with SM-as. -
Backups occur at scheduled time based on Policy associated to Resource Group. Jobs can be monitored from the Dashboard job monitor or from the backup info on those resources.
-
VMs can be restored to same or alternate vCenter from the SVM on Primary fault domain or from one of the secondary locations.
-
Similar option is also available for Datastore mount operation.
For assistance with additional operations with SCV, refer SnapCenter Plug-in for VMware vSphere documentation