vMSC Design and Implementation Guidelines

03/12/2025 Contributors

PDFs

This document outlines the design and implementation guidlines for vMSC with ONTAP storage systems.

NetApp Storage Configuration

Setup instructions for NetApp MetroCluster (referred to as an MCC configuration) are available at MetroCluster Documentation. Instructions for SnapMirror active sync (SMas) are also available at SnapMirror Business Continuity overview.

Once you have configured MetroCluster, administering it is like managing a traditional ONTAP environment. You can set up Storage Virtual Machines (SVMs) using various tools like the Command Line Interface (CLI), System Manager, or Ansible. Once the SVMs are configured, create Logical Interfaces (LIFs), volumes, and Logical Unit Numbers (LUNs) on the cluster that will be used for normal operations. These objects will automatically be replicated to the other cluster using the cluster peering network.

If not using MetroCluster, you can use SnapMirror active sync which provides datastore-granular protection and active-active access across multiple ONTAP clusters in different failure domains. SMas uses consistency groups (CGs) to ensure write-order consistency among one or more datastores and you can create multiple CGs depending on your application and datastore requirements. Consistency groups are especially useful for applications that require data synchronization between multiple datastores. SMas also supports Raw Device Mappings (RDMs) and guest-connected storage with in-guest iSCSI initiators. You can learn more about consistency groups at Consistency groups overview.

ONTAP tools now provides a simple way to configure SnapMirror active sync for vMSC. You can use the ONTAP tools vCenter plugin to create and manage SnapMirror active sync relationships between two ONTAP clusters. The plugin provides a simple and intuitive interface to create and manage SnapMirror active sync relationships between two ONTAP clusters. You can learn more about the ONTAP tools vCenter plugin at ONTAP tools for VMware vSphere, or jump right into Protect using host cluster protection.

There is some difference in managing a vMSC configuration with SnapMirror active sync when compared to a MetroCluster. First, SMas is a SAN-only configuration, no NFS datastores can be protected with SnapMirror active sync. Second, you must map both copies of the LUNs to your ESXi hosts for them to access the replicated datastores in both failure domains.

VMware vSphere Configuration

Create a vSphere HA Cluster

Creating a vSphere HA cluster is a multi-step process that is fully documented at How Do You Create and Configure Clusters in the vSphere Client on docs.vmware.com. In short, you must first create an empty cluster, and then, using vCenter, you must add hosts and specify the cluster’s vSphere HA and other settings.

Nothing in this document supersedes VMware vSphere Metro Storage Cluster Recommended Practices. This content is provided for easy reference and is not a substitute for the official VMware documentation.

To configure an HA cluster, complete the following steps:

Connect to the vCenter UI.
In Hosts and Clusters, browse to the data center where you want to create your HA cluster.
Right-click the data center object and select New Cluster. Under basics ensure you have enabled vSphere DRS and vSphere HA. Complete the wizard.

Select the cluster and go to the configure tab. Select vSphere HA and click edit.
Under Host Monitoring, select the Enable Host Monitoring option.

While still on the Failures and Responses tab, Under VM Monitoring, select the VM Monitoring Only option or VM and Application Monitoring option.

Under Admission Control, set the HA admission control option to cluster resource reserve; use 50% CPU/MEM.

Click “OK”.
Select DRS and click EDIT.
Set the automation level to manual unless required by your applications.

Enable VM Component Protection, refer to docs.vmware.com.
The following additional vSphere HA settings are recommended for vMSC with MCC:

Failure	Response
Host failure	Restart VMs
Host isolation	Disabled
Datastore with Permanent Device Loss (PDL)	Power off and restart VMs
Datastore with All paths Down (APD)	Power off and restart VMs
Guest not heartbeating	Reset VMs
VM restart policy	Determined by the importance of the VM
Response for host isolation	Shut down and restart VMs
Response for datastore with PDL	Power off and restart VMs
Response for datastore with APD	Power off and restart VMs (conservative)
Delay for VM failover for APD	3 minutes
Response for APD recovery with APD timeout	Disabled
VM monitoring sensitivity	Preset high

Failure

Response

Host failure

Restart VMs

Host isolation

Disabled

Datastore with Permanent Device Loss (PDL)

Power off and restart VMs

Datastore with All paths Down (APD)

Power off and restart VMs

Guest not heartbeating

Reset VMs

VM restart policy

Determined by the importance of the VM

Response for host isolation

Shut down and restart VMs

Response for datastore with PDL

Power off and restart VMs

Response for datastore with APD

Power off and restart VMs (conservative)

Delay for VM failover for APD

3 minutes

Response for APD recovery with APD timeout

Disabled

VM monitoring sensitivity

Preset high

Configure Datastores for Heartbeating

vSphere HA uses datastores to monitor hosts and virtual machines when the management network has failed. You can configure how vCenter selects heartbeat datastores. To configure datastores for heartbeating, complete the following steps:

In the Datastore Heartbeating section, select Use Datastores from the Specified List and Compliment Automatically if Needed.
Select the datastores you want vCenter to use from both sites and press OK.

A screenshot of a computer Description automatically generated

Configure Advanced Options

Isolation events occur when hosts within an HA cluster lose connectivity to either the network or other hosts in the cluster. By default, vSphere HA will use the default gateway for its management network as the default isolation address. However, you can specify additional isolation addresses for the host to ping to determine whether an isolation response should be triggered. Add two isolation IPs that can ping, one per site. Do not use the gateway IP. The vSphere HA advanced setting used is das.isolationaddress. You can use ONTAP or Mediator IP addresses for this purpose.

Refer to VMware vSphere Metro Storage Cluster Recommended Practices for more information.

Adding an advanced setting called das.heartbeatDsPerHost can increase the number of heartbeat datastores. Use four heartbeat datastores (HB DSs)—two per site. Use the “Select from List but Compliment” option. This is needed because if one site fails, you still need two HB DSs. However, those don’t have to be protected with MCC or SnapMirror active sync.

Refer to VMware vSphere Metro Storage Cluster Recommended Practices for more information.

VMware DRS Affinity for NetApp MetroCluster

In this section, we create DRS groups for VMs and hosts for each site\cluster in the MetroCluster environment. Then we configure VM\Host rules to align VM host affinity with local storage resources. For example, site A VMs belong to VM group sitea_vms and site A hosts belong to host group sitea_hosts. Next, in VM\Host Rules, we state that sitea_vms should run on hosts in sitea_hosts.

NetApp highly recommends the specification Should Run on Hosts in Group rather than the specification Must Run on Hosts in Group. In the event of a site A host failure, the VMs of site A need to be restarted on hosts at site B through vSphere HA, but the latter specification does not allow HA to restart VMs on site B because it’s a hard rule. The former specification is a soft rule and will be violated in the event of HA, thus enabling availability rather than performance.
You can create an event-based alarm that is triggered when a virtual machine violates a VM-Host affinity rule. In the vSphere Client, add a new alarm for the virtual machine and select “VM is violating VM-Host Affinity Rule” as the event trigger. For more information about creating and editing alarms, refer to vSphere Monitoring and Performance documentation.

Create DRS Host Groups

To create DRS host groups specific to site A and site B, complete the following steps:

In the vSphere web client, right-click the cluster in the inventory and select Settings.
Click VM\Host Groups.
Click Add.
Type the name for the group (for instance, sitea_hosts).
From the Type menu, select Host Group.
Click Add and select the desired hosts from site A and click OK.
Repeat these steps to add another host group for site B.
Click OK.

Create DRS VM Groups

To create DRS VM groups specific to site A and site B, complete the following steps:

In the vSphere web client, right-click the cluster in the inventory and select Settings.

Click VM\Host Groups.
Click Add.
Type the name for the group (for instance, sitea_vms).
From the Type menu, select VM Group.
Click Add and select the desired VMs from site A and click OK.
Repeat these steps to add another host group for site B.
Click OK.

Create VM Host Rules

To create DRS affinity rules specific to site A and site B, complete the following steps:

In the vSphere web client, right-click the cluster in the inventory and select Settings.

Click VM\Host Rules.
Click Add.
Type the name for the rule (for instance, sitea_affinity).
Verify the Enable Rule option is checked.
From the Type menu, select Virtual Machines to Hosts.
Select the VM group (for instance, sitea_vms).
Select the Host group (for instance, sitea_hosts).
Repeat these steps to add another VM\Host Rule for site B.
Click OK.

Create datastore clusters if needed

To configure a datastore cluster for each site, complete the following steps:

Using the vSphere web client, browse to the data center where the HA cluster resides under Storage.
Right-click the data center object and select Storage > New Datastore Cluster.

*When using ONTAP storage, it is recommended to disable Storage DRS.

Storage DRS is generally not needed or recommended for use with ONTAP storage systems.
ONTAP provides its own storage efficiency features, such as deduplication, compression, and compaction, which can be impacted by Storage DRS.
If you are using ONTAP snapshots, then storage vMotion would leave behind the copy of the VM in the snapshot, potentially increasing storage utilization and may impact backup applications like NetApp SnapCenter which track VMs and their ONTAP snapshots.

Select the HA cluster and click Next.

Select the datastores belonging to site A and click Next.

Review options and click Finish.
Repeat these steps to create the site B datastore cluster and verify that only datastores of site B are selected.

vCenter Server Availability

Your vCenter Server Appliances (VCSAs) should be protected with vCenter HA. vCenter HA allows you to deploy two VCSAs in an active-passive HA pair. One in each failure domain. You can read more about vCenter HA on docs.vmware.com.

vMSC Design and Implementation Guidelines

Creating your file...

NetApp Storage Configuration

VMware vSphere Configuration

Create a vSphere HA Cluster

Configure Datastores for Heartbeating

Configure Advanced Options

Create DRS Host Groups

Create DRS VM Groups

Create VM Host Rules

Create datastore clusters if needed

vCenter Server Availability