Configuring VMware High Availability Network Redundancy

 

 

 

How to Enable Technical Support Mode on an ESXi Host

The following steps will show you how to enable SSH access to your ESXi host.

  1. Go to Home > Inventory > Hosts and Clusters.
  2. Select your ESXi host in the inventory and click the Configuration tab.
  3. In the Software panel, click Security Profile.
  4. Click Properties.
  5. Select Remote Tech Support (SSH) and click Options.
  6. Verify that the Remote Tech Support (SSH) service is started. If it is not, start the service:
    1. Select Start and stop with host.
    2. Click Start. Wait for the service to start and click OK.
    3. Confirm that Remote Tech Support (SSH) is running and click OK.

 

 

 

How to Test Default VMware HA Failure Detection Time

By default, VMware High Availability will initiate corrective action of no heartbeats from a cluster node are received for 15 seconds. This configuration is not ideal for all environments. If the interval is too short, false positives can cause unnecessary downtime. If the interval is too long, service-level agreements might not be met.

The following steps will show you how to adjust the failure detection time.

  1. Open SSH client.
  2. Log in as root to the higher-numbered ESXi host.
  3. Start monitoring log messages in the VMware HA log. Type this command. The command is case-sensitive:

    # tail –f /var/log/vmware/aam/vmware_<higher-numbered_host_name>.log

    For example: # tail –f /var/log/vmware/aam/vmware_sc-plum02.log

  4. Prepare to time how long VMware HA takes to detect a failure.
  5. Simulate a host failure by rebooting the lower-0numbered ESXi host:
    1. Right-click the lower-numbered ESXi host in the inventory and select Reboot.
    2. When a message warns you that your host is not in maintenance mode, click Yes to reboot.
    3. Type Testing VMware HA as the reason for rebooting and click OK.

 

 

 

How to Modify and Test VMware HA Failure Detection Time

The following steps will show you how to increase the failure detection time and test it.

  1. Right-click Lab Cluster in the inventory and select Edit Settings.
  2. Select Cluster Features.
  3. Deselect the Turn On VMware HA check box and click OK.
  4. Wait until the Unconfiguring HA tasks in the Recent Tasks pane complete.
  5. Right-click Lab Cluster in the inventory and select Edit Settings.
  6. Select Cluster Features.
  7. Select the Turn On VMware HA check box.
  8. Select VMware HA and click Advanced Options.
  9. In the Advanced Options (HA) dialog box, type this option-value pair:
  10. Das.failuredetectiontime 30000

  11. Click OK in the Advanced Options (HA) dialog box.
  12. Click OK to commit the changes.
  13. Wait until the Configuring HA tasks in the Recent Tasks pane complete.

 

 

 

 

How to Configure Management Network Redundancy

The following steps will show you how to configure VMware HA network redundancy.

  1. Select your ESXi host in the inventory.
  2. Click the Configuration tab.
  3. Click the Networking link.
  4. Select the virtual switch with the VMware vMotion VMkernel port on it and click Properties.
  5. Select vMotion in the Configuration column of the Ports pane and click Edit.
  6. Select the Management traffic check box and click OK.
  7. Click Close to exit the vSwitch Properties dialog box.

 

 

 

How to Configure VMware HA Host Isolation Response

The following steps will show you how to configure the action taken when an ESXi host that is participating in a VMware HA cluster becomes isolated.

 

 

 

How to Test Network Isolation Functionality

The following steps will show you how to test cluster behavior after network isolation is detected.

  1. Select the higher-numbered ESXi host in the inventory and click Virtual Machines tab.
  2. Migrate all virtual machines you choose to the lower-numbered ESXi host in the cluster.
  3. Note the ESXi host name the virtual machines were migrated to.
  4. If SSH session is closed, open your SSH client and log in as root to the lower-numbered ESXi host.
  5. From the SSH session of the lower-numbered ESXi host, disable the alternate VMware HA heartbeat port, vMotion. Type this command:
    # esxcfg-vmknic –D vMotion
  6. Select Lab Cluster in inventory and click Virtual Machines tab.
  7. Monitor the running virtual machines for the next several minutes. If VMware HA can still communicate between the nodes in the cluster, all virtual machines will remain powered on.
  8. Access the console interface of the lower-numbered ESXi host (for example iLO or DRAC).
  9. Disable the management network interface on the lower-numbered ESXi host:
    1. GO to the console of the lower-numbered ESXi host and press F2.
    2. Enter root password for the ESXi host.
    3. Scroll down the list and select Disable Management Network. Press Enter.
    4. Press F11 to disable the management network. Leave the ESXi console open.
  10. Monitor the log file messages in the SSH session of the higher-numbered ESXi host. And monitor the inventory in the vSphere Client. Wait for 2 minutes.
  11. When the lower-numbered ESXi host stops responding, check the power status of your virtual machines. Are all virtual machines powered on and running on the same ESXi host in step 3? If yes, the task was done correctly. If the virtual machines have been powered off and are being restarted on anther cluster node, verify that you executed the above task correctly.
  12. Reenable the management network interface on the lower-numbered ESXi host:
    1. Return to the console of the lower-numbered ESXi host.
    2. Scroll down and select Enable Management Network. Press Enter.
    3. Press F11 to enable the management network.
  13. From the SSH session on the lower-numbered ESXi host, reenable the secondary interface on the lower-numbered ESXi host. Type the following command:
    # esxcfg-vmknic -e vMotion
  14. Exit SSH session.
  15. Close the ESXi host console.
  16. Press Ctrl+C to stop log monitoring in the SSH sessions and exit the SSH session.
  17. Reenable the VMware vMotion VMkernel port for each ESXi host in the inventory:
    1. Select an ESXi host.
    2. Click the Configuration tab and click the Networking link.
    3. Select the virtual switch with the vMotion VMkernel port on it and click Properties.
    4. Select vMotion in the Configuration column of the Port pane and click OK.
    5. Click Close to exit the vSwitch Properties dialog box.