I have a linux guest vm that was reset early 1 morning because of the above HA message after coming out of a VDP backup snapshot but I was wondering if it should have been and what if anything I should do to make sure either my HA settings are appropriate for the cluster or vm or whether I need to make other changes to avoid the issue happening when least appropriate
Below are the events as listed on vSphere for the vm in question. It is very large and there could have been alot of guest disk i/o during the snap to consolidate
The guest doesn't appear to be showing any ill effects I'm just not sure why it was reset
Time | Event Description | Type | My Notes |
---|---|---|---|
01:23:14 am | Task Create virtual machine snapshot | info | Within VDP backup window |
05:31:49 am | Task Remove snapshot | info | |
06:22:37 am | vSphere HA cannot reset this virtual machine | warning | |
06:22:38 am | Alarm vSphere HA virtual machine monitoring error changed from Gray to Red | info | |
06.22:38 am | Alarm vSphere HA virtual machine monitoring error on GUESTVM triggered an action | info | |
06.22:38 am | Alarm vSphere HA virtual machine monitoring error an SNMP trap was sent | info | GUEST VM in DMZ and on different vlan to vCenter or hosts |
06.23:16 am | vSphere HA cannot reset this virtual machine | warning | |
06.23.52 am | Virtual machine disks consolidated successded | info | |
06.23.59 am | Message from ESXiHost: Install the VMware tools package inside this virutal machine | info | vmware tools was already installed and matches host |
06.23.59 am | This virtual machine reset by vSphere HA: VMware Tools heartbeat failure: A screen shot is saved in /datastore/vm/vm-1.png | info | Guest OS was still running in the image, no screen of death |
06.23.59 am | Alarm vSphere HA virtual machine monitoring action changed from Green to Yellow | info |
Given HA & the vm are set up as follows
- HA Cluster Settings:
- cluster default vm restart priority: Medium
- Guest restart priority: High
- Datastore heartbeat: 2 datastores (1 hosting the guest vm the other hosting the vdp appliance)
- VM Settings
- Linux Guest
- vmxnet 3 vNic connected to DMZ vlan
- vm version 7:
There was some disk latency warnings before the backup snapshots were created but no loss of paths to either the guest vm datastore or the backup destination was reported
Am I right in thinking that HA shouldn't have been triggered due to the disk I/O from the consolidation of the snapshot even if it was taking a long time?