Troubleshoot Azure virtual machine backup
You can troubleshoot errors encountered while using Azure Backup with the information listed below:
This section covers backup operation failure of Azure Virtual machine.
CopyingVHDsFromBackUpVaultTakingLongTime - Copying backed up data from vault timed out
Error code: CopyingVHDsFromBackUpVaultTakingLongTime
Error message: Copying backed up data from vault timed out
This could happen due to transient storage errors or insufficient storage account IOPS for backup service to transfer data to the vault within the timeout period. Configure VM backup using these best practices and retry the backup operation.
UserErrorVmNotInDesirableState - VM is not in a state that allows backups.
Error code: UserErrorVmNotInDesirableState
Error message: VM is not in a state that allows backups.
The backup operation failed because the VM is in Failed state. For successful backup the VM state should be Running, Stopped, or Stopped (deallocated).
- If the VM is in a transient state between Running and Shut down, wait for the state to change. Then trigger the backup job.
- If the VM is a Linux VM and uses the Security-Enhanced Linux kernel module, exclude the Azure Linux Agent path /var/lib/waagent from the security policy and make sure the Backup extension is installed.
UserErrorFsFreezeFailed - Failed to freeze one or more mount-points of the VM to take a file-system consistent snapshot
Error code: UserErrorFsFreezeFailed
Error message: Failed to freeze one or more mount-points of the VM to take a file-system consistent snapshot.
- Check the file system state of all mounted devices using the tune2fs command, for example tune2fs -l /dev/sdb1 \.| grep Filesystem state.
- Unmount the devices for which the file system state was not cleaned, using the umount command.
- Run a file system consistency check on these devices by using the fsck command.
- Mount the devices again and retry backup operation.
ExtensionSnapshotFailedCOM / ExtensionInstallationFailedCOM / ExtensionInstallationFailedMDTC - Extension installation/operation failed due to a COM+ error
Error code: ExtensionSnapshotFailedCOM
Error message: Snapshot operation failed due to COM+ error
Error code: ExtensionInstallationFailedCOM
Error message: Extension installation/operation failed due to a COM+ error
Error code: ExtensionInstallationFailedMDTC
Error message: Extension installation failed with the error "COM+ was unable to talk to the Microsoft Distributed Transaction Coordinator
The Backup operation failed due to an issue with Windows service COM+ System application. To resolve this issue, follow these steps:
- Try starting/restarting Windows service COM+ System Application (from an elevated command prompt - net start COMSysApp).
- Ensure Distributed Transaction Coordinator services is running as Network Service account. If not, change it to run as Network Service account and restart COM+ System Application.
- If unable to restart the service, then reinstall Distributed Transaction Coordinator service by following the below steps:
- Stop the MSDTC service
- Open a command prompt (cmd)
- Run command “msdtc -uninstall”
- un command “msdtc -install”
- Start the MSDTC service
- Start the Windows service COM+ System Application. After the COM+ System Application starts, trigger a backup job from the Azure portal.
ExtensionFailedVssWriterInBadState - Snapshot operation failed because VSS writers were in a bad state
Error code: ExtensionFailedVssWriterInBadState
Error message: Snapshot operation failed because VSS writers were in a bad state.
Restart VSS writers that are in a bad state. From an elevated command prompt, run
vssadmin list writers. The output contains all VSS writers and their state. For every VSS writer with a state that's not  Stable, to restart VSS writer, run the following commands from an elevated command prompt:
net stop serviceName
net start serviceName
ExtensionConfigParsingFailure - Failure in parsing the config for the backup extension
Error code: ExtensionConfigParsingFailure
Error message: Failure in parsing the config for the backup extension.
This error happens because of changed permissions on the MachineKeys directory: %systemdrive%\programdata\microsoft\crypto\rsa\machinekeys. Run the following command and verify that permissions on the MachineKeys directory are default ones:icacls %systemdrive%\programdata\microsoft\crypto\rsa\machinekeys.
Default permissions are as follows:
- Everyone: (R,W)
- BUILTIN\Administrators: (F)
If you see permissions in the MachineKeys directory that are different than the defaults, follow these steps to correct permissions, delete the certificate, and trigger the backup:
Fix permissions on the MachineKeys directory. By using Explorer security properties and advanced security settings in the directory, reset permissions back to the default values. Remove all user objects except the defaults from the directory and make sure the Everyone permission has special access as follows:
- List folder/read data
- Read attributes
- Read extended attributes
- Create files/write data
- Create folders/append data
- Write attributes
- Write extended attributes
- Read permissions
Delete all certificates where Issued To is the classic deployment model or Windows Azure CRP Certificate Generator:
- Open certificates on a local computer console.
- Under Personal > Certificates, delete all certificates where Issued To is the classic deployment model or Windows Azure CRP Certificate Generator.
Trigger a VM backup job.
ExtensionStuckInDeletionState - Extension state is not supportive to backup operation
Error code: ExtensionStuckInDeletionState
Error message: Extension state is not supportive to backup operation
The Backup operation failed due to inconsistent state of Backup Extension. To resolve this issue, follow these steps:
- Ensure Guest Agent is installed and responsive
- From Azure portal go to Virtual Machine > All Settings > Extensions
- Select the backup extension VmSnapshot or VmSnapshotLinux and click Uninstall
- After deleting backup extension retry the backup operation
- The subsequent backup operation will install the new extension in the desired state
ExtensionFailedSnapshotLimitReachedError - Snapshot operation failed as snapshot limit is exceeded for some of the disks attached
Error code: ExtensionFailedSnapshotLimitReachedError
Error message: Snapshot operation failed as snapshot limit is exceeded for some of the disks attached
The snapshot operation failed as the snapshot limit has exceeded for some of the disks attached. Complete the below troubleshooting steps and then retry the operation.
Delete the disk blob-snapshots which are not required. Be cautious to not delete Disk blob, only snapshot blobs should be deleted.
If Soft-delete is enabled on VM disk Storage-Accounts, configure soft-delete retention such that existing snapshots are less than the maximum allowed at any point of time.
If Azure Site Recovery is enabled in the backed up VM, then perform the below:
- Ensure the value of isanysnapshotfailed is set as false in /etc/azure/vmbackup.conf
- Schedule Azure Site Recovery at a different time, such that it does not conflict the backup operation.
ExtensionFailedTimeoutVMNetworkUnresponsive - Snapshot operation failed due to inadequate VM resources.
Error code: ExtensionFailedTimeoutVMNetworkUnresponsive
Error message: Snapshot operation failed due to inadequate VM resources.
Backup operation on the VM failed due to delay in network calls while performing the snapshot operation. To resolve this issue, perform Step 1. If the issue persists, try steps 2 and 3.
Step 1: Create snapshot through Host
From an elevated (admin) command-prompt, run the below command:
REG ADD "HKLM\SOFTWARE\Microsoft\BcdrAgentPersistentKeys" /v SnapshotMethod /t REG_SZ /d firstHostThenGuest /f REG ADD "HKLM\SOFTWARE\Microsoft\BcdrAgentPersistentKeys" /v CalculateSnapshotTimeFromHost /t REG_SZ /d True /f
This will ensure the snapshots are taken through host instead of Guest. Retry the backup operation.
Step 2: Try changing the backup schedule to a time when the VM is under less load (less CPU/IOps etc.)
Step 3: Try increasing the size of VM and retry the operation
Common VM backup errors
|Error code: 320001
Error message: Could not perform the operation as VM no longer exists.
Error code: 400094
Error message: The virtual machine doesn't exist
An Azure virtual machine wasn't found.
|This error happens when the primary VM is deleted, but the backup policy still looks for a VM to back up. To fix this error, take the following steps:
|The VM is in failed provisioning state:
Restart the VM and make sure the VM is running or shut down.
|This error occurs when one of the extension failures puts the VM into failed provisioning state. Go to the extensions list, check if there's a failed extension, remove it, and try restarting the virtual machine. If all extensions are in running state, check if the VM Agent service is running. If not, restart the VM Agent service.|
|Error code: UserErrorBCMPremiumStorageQuotaError
Error message: Could not copy the snapshot of the virtual machine, due to insufficient free space in the storage account
|For premium VMs on VM backup stack V1, we copy the snapshot to the storage account. This step makes sure that backup management traffic, which works on the snapshot, doesn't limit the number of IOPS available to the application using premium disks.
We recommend that you allocate only 50 percent, 17.5 TB, of the total storage account space. Then the Azure Backup service can copy the snapshot to the storage account and transfer data from this copied location in the storage account to the vault.
|Failed to install Microsoft Recovery Services extension as virtual machine is not running
The VM Agent is a prerequisite for the Azure Recovery Services extension. Install the Azure Virtual Machine Agent and restart the registration operation.
|The snapshot operation failed with the Volume Shadow Copy Service (VSS) operation error This drive is locked by BitLocker Drive Encryption. You must unlock this drive from the Control Panel.||Turn off BitLocker for all drives on the VM and check if the VSS issue is resolved.|
|The VM isn't in a state that allows backups.||
|The VM Agent isn't present on the virtual machine:
Install any prerequisite and the VM Agent. Then restart the operation.
|Read more about VM Agent installation and how to validate VM Agent installation.|
|Backup failed to freeze one or more mount points of the VM to take a file system consistent snapshot.||Take the following step:
|The snapshot operation failed because of failure to create a secure network communication channel.||
|The snapshot operation failed because of failure to install Visual C++ Redistributable for Visual Studio 2012.||Navigate to C:\Packages\Plugins\Microsoft.Azure.RecoveryServices.VMSnapshot\agentVersion and install vcredist2012_x64.
Make sure that the registry key value that allows the service installation is set to the correct value. That is, set the Start value in HKEY_LOCAL_MACHINE\SYSTEM\CurrentControlSet\Services\Msiserver to 3 and not 4.
If you still have issues with installation, restart the installation service by running MSIEXEC /UNREGISTER followed by MSIEXEC /REGISTER from an elevated command prompt.
|Cancellation isn't supported for this job type:
Wait until the job finishes.
|The job isn't in a cancelable state:
Wait until the job finishes.
The selected job isn't in a cancelable state:
Wait for the job to finish.
|It's likely that the job is almost finished. Wait until the job is finished.|
|Backup can't cancel the job because it isn't in progress:
Cancellation is supported only for jobs in progress. Try to cancel an in-progress job.
|This error happens because of a transitory state. Wait a minute and retry the cancel operation.|
|Backup failed to cancel the job:
Wait until the job finishes.
|Restore failed with a cloud internal error.||
|The selected DNS name is already taken:
Specify a different DNS name and try again.
|This DNS name refers to the cloud service name, usually ending with .cloudapp.net. This name needs to be unique. If you get this error, you need to choose a different VM name during restore.
This error is shown only to users of the Azure portal. The restore operation through PowerShell succeeds because it restores only the disks and doesn't create the VM. The error will be faced when the VM is explicitly created by you after the disk restore operation.
|The specified virtual network configuration isn't correct:
Specify a different virtual network configuration and try again.
|The specified cloud service is using a reserved IP that doesn't match the configuration of the virtual machine being restored:
Specify a different cloud service that isn't using a reserved IP. Or choose another recovery point to restore from.
|The cloud service has reached its limit on the number of input endpoints:
Retry the operation by specifying a different cloud service or by using an existing endpoint.
|The Recovery Services vault and target storage account are in two different regions:
Make sure the storage account specified in the restore operation is in the same Azure region as your Recovery Services vault.
|The storage account specified for the restore operation isn't supported:
Only Basic or Standard storage accounts with locally redundant or geo-redundant replication settings are supported. Select a supported storage account.
|The type of storage account specified for the restore operation isn't online:
Make sure that the storage account specified in the restore operation is online.
|This error might happen because of a transient error in Azure Storage or because of an outage. Choose another storage account.|
|The resource group quota has been reached:
Delete some resource groups from the Azure portal or contact Azure Support to increase the limits.
|The selected subnet doesn't exist:
Select a subnet that exists.
|The Backup service doesn't have authorization to access resources in your subscription.||To resolve this error, first restore disks by using the steps in Restore backed-up disks. Then use the PowerShell steps in Create a VM from restored disks.|
Backup or restore takes time
Set up the VM Agent
Typically, the VM Agent is already present in VMs that are created from the Azure gallery. But virtual machines that are migrated from on-premises datacenters won't have the VM Agent installed. For those VMs, the VM Agent needs to be installed explicitly.
- Download and install the agent MSI. You need Administrator privileges to finish the installation.
- For virtual machines created by using the classic deployment model, update the VM property to indicate that the agent is installed. This step isn't required for Azure Resource Manager virtual machines.
- Install the latest version of the agent from the distribution repository. For details on the package name, see the Linux Agent repository.
- For VMs created by using the classic deployment model, use this blog to update the VM property and verify that the agent is installed. This step isn't required for Resource Manager virtual machines.
Update the VM Agent
- To update the VM Agent, reinstall the VM Agent binaries. Before you update the agent, make sure no backup operations occur during the VM Agent update.
To update the Linux VM Agent, follow the instructions in the article Updating the Linux VM Agent.
Always use the distribution repository to update the agent.
Don't download the agent code from GitHub. If the latest agent isn't available for your distribution, contact the distribution support for instructions to acquire the latest agent. You can also check the latest Windows Azure Linux agent information in the GitHub repository.
Validate VM Agent installation
Verify the VM Agent version on Windows VMs:
- Sign in to the Azure virtual machine and navigate to the folder C:\WindowsAzure\Packages. You should find the WaAppAgent.exe file.
- Right-click the file and go to Properties. Then select the Details tab. The Product Version field should be 2.6.1198.718 or higher.
Troubleshoot VM snapshot issues
VM backup relies on issuing snapshot commands to underlying storage. Not having access to storage or delays in a snapshot task run can cause the backup job to fail. The following conditions can cause snapshot task failure:
Network access to Storage is blocked by using NSG. Learn more on how to establish network access to Storage by using either allowed list of IPs or through a proxy server.
VMs with SQL Server backup configured can cause snapshot task delay. By default, VM backup creates a VSS full backup on Windows VMs. VMs that run SQL Server, with SQL Server backup configured, can experience snapshot delays. If snapshot delays cause backup failures, set following registry key:
VM status is reported incorrectly because the VM is shut down in RDP. If you used the remote desktop to shut down the virtual machine, verify that the VM status in the portal is correct. If the status isn't correct, use the Shutdown option in the portal VM dashboard to shut down the VM.
If more than four VMs share the same cloud service, spread the VMs across multiple backup policies. Stagger the backup times, so no more than four VM backups start at the same time. Try to separate the start times in the policies by at least an hour.
The VM runs at high CPU or memory. If the virtual machine runs at high memory or CPU usage, more than 90 percent, your snapshot task is queued and delayed. Eventually it times out. If this issue happens, try an on-demand backup.
Like all extensions, Backup extensions need access to the public internet to work. Not having access to the public internet can manifest itself in various ways:
- Extension installation can fail.
- Backup operations like disk snapshot can fail.
- Displaying the status of the backup operation can fail.
The need to resolve public internet addresses is discussed in this Azure Support blog. Check the DNS configurations for the VNET and make sure the Azure URIs can be resolved.
After name resolution is done correctly, access to the Azure IPs also needs to be provided. To unblock access to the Azure infrastructure, follow one of these steps:
- Allow list of Azure datacenter IP ranges:
- Create a path for HTTP traffic to flow:
- If you have some network restriction in place, deploy an HTTP proxy server to route the traffic. An example is a network security group. See the steps to deploy an HTTP proxy server in Establish network connectivity.
- Add rules to the NSG, if you have one in place, to allow access to the internet from the HTTP proxy.
DHCP must be enabled inside the guest for IaaS VM backup to work. If you need a static private IP, configure it through the Azure portal or PowerShell. Make sure the DHCP option inside the VM is enabled. Get more information on how to set up a static IP through PowerShell: