question

WolfiPu avatar image
6 Votes"
WolfiPu asked ·

There is a problem with Radeon Instinct MI25 MxGPU device. For more information, search for 'graphics device driver error code 43'

I have Windows 10, 1909 installed on a NV4as_v4
After installation of the AMD Driver according to https://docs.microsoft.com/en-us/azure/virtual-machines/windows/n-series-amd-driver-setup
I get the error:

There is a problem with Radeon Instinct MI25 MxGPU device. For more information, search for graphics device driver error code 43

any ideas ?

azure-virtual-machines
· 9
10 |1000 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

Can you stop the machine once and start again and try?

0 Votes 0 · ·

Hi, I was stuck with the same error code. What worked for me is ignoring the installer from the official Azure account. I used the Microsoft Windows® 10 (64-bit) for Workstation link.

https://www.amd.com/en/support/kb/release-notes/rn-pro-win-20-q1

Not sure if it was related, but I had Windows 10 Pro instead of Enterprise in the Virtual Machine.

0 Votes 0 · ·

Hi, Only AMD drivers published at https://docs.microsoft.com/en-us/azure/virtual-machines/windows/n-series-amd-driver-setup are supported on NVv4 VMs. If you hit the error code 43 issue on NVv4 VMs then please open a support case and our teams will assist to fix the issue. Please do not install any drivers published on the AMD portal.

0 Votes 0 · ·

Experiencing the same issue here, code 43 using official drivers linked by Microsoft

Several thoughts:

  • This is an especially bad issue if it happens unpredictably; you could have 100 users with Radeon Instinct VDI desktops and some users could have broken drivers, and not know why their graphics software is running poorly. They may get a bad impression of GPU-Accelerated VDI in general if this happens

  • I feel like you should credit back everyone's account that spent time running these instances while this problem is occuring

  • Why do you still offer them for sale if this has been a reported issue for months?


1 Vote 1 · ·

So I put this question out on https://docs.microsoft.com/en-us/azure/virtual-machines/windows/n-series-amd-driver-setup as well, but does anyone know if these drivers even support 1909 and 2004? The article only recommends 1903 as the latest. I'm assuming so since it's just the normal GA drivers, but wanted to verify.

0 Votes 0 · ·

Got my answer. NVv4 are not supported to run on 1909 or 2004.

0 Votes 0 · ·

FYI I notified AMD
They said they found the issue and gave MS the fix, they will be validating it and will deploy it during the next scheduled maintenance cycle

0 Votes 0 · ·

That's fantastic news thank you for sharing the outcome. Did they mention if any action would be required on the administrators behalf such as patch or hotfix, or will this be something they modify directly at the VM Resource level?

0 Votes 0 · ·
vikancha avatar image vikancha KoffskiJoshuaA-3543 ·

We expect to rollout the fix starting next week. The fix is at the Azure platform level and will require customers to stop/start the VM so the VM comes up again on the updated infrastructure.

4 Votes 4 · ·
WolfiPu avatar image
0 Votes"
WolfiPu answered ·

yes ... i tried to turn it off and on again...

· 1 · Share
10 |1000 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

@WolfiPu

Sorry for the inconvenience you are facing.
Are you still seeing the issue?
If you are still facing the issue, I request you to open a support request to get the issue checked by Azure Support.

0 Votes 0 · ·
MZe-9873 avatar image
0 Votes"
MZe-9873 answered ·

Same here. I got it to work. But after restarting the VM its stopped working again.

· Share
10 |1000 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

bdimag avatar image
0 Votes"
bdimag answered ·

@jakaruna-MSFT
I get the same following instructions from https://docs.microsoft.com/en-us/azure/virtual-machines/windows/n-series-amd-driver-setup

I have tried

  • Windows 10 EVD - Build 1903 (sku: 19h1-evd)

  • Windows 10 - Build 1809 (sku: rs5-pro-g2)

· Share
10 |1000 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

Nathan-9398 avatar image
0 Votes"
Nathan-9398 answered ·

Same. Once I de-allocate the VM, I lose the ability to use the video device. Any attempt to revive it has been unsuccessful. I've tried to rebuild the VM with the same disk image, no luck. I've tried changing sizes, no luck. When I start a fresh VM and fresh drive, it's inconsistent if the driver and video device come online successfully. I've ended up spending $80 on just trying to get a new instance up and working again, then spend time to re-apply any of my own environment to it.

· Share
10 |1000 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

mechgt-2155 avatar image
0 Votes"
mechgt-2155 answered ·

I was completely unable to get the Radeon card working with Win10 at all, but was able to get it to work win Win Server 2016 in Azure. However after reboot (or update?) it seems to have the same issue:

Windows has stopped this device because it has reported problems. (Code 43)

· 3 · Share
10 |1000 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

@mechgt-2155 Apologies for the issue you are facing. We are investigating the code 43 related issue with the GPU and working on a fix. We expect the fix to be ready in the next few days. Please do not install any driver that is not published by Microsoft. If you have not done already then please open a support case so we can notify you when the fix is rolled out.


0 Votes 0 · ·

Since it's been a few days any any update on resolving this issue? And is there anything that should be referenced when opening a support case since this appears to be a known issue?

1 Vote 1 · ·
ChrisTwiest-3509 avatar image
0 Votes"
ChrisTwiest-3509 answered ·

If been also really busy with testing and getting to the root of the error. And I got it fixed now. During our automatic deployment we disable UAC for silent install reasons.
When installing the driver with UAC disabled I got the error Code 43 now I changed it and leave UAC on and I still get the code 43 but then stopping (dealocatting) and starting the VM after that and the driver and GPU works.. Maybe it's just a fluke but it works now..

· 1 · Share
10 |1000 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

Stop/deallocate and then Start didn't help me. I left UAC at default, and with UAC disabled, same result. Something is not stable with this thing - especially for it to be working and just stop.

0 Votes 0 · ·
TheAMan-4866 avatar image
0 Votes"
TheAMan-4866 answered ·

Have been encountering this issue since nv4asv4 was released! Even went so far as to disable the Microsoft Remote Display Adapter, hoping that doing so would let the AMD driver register before the system chooses the Remote Display Adapter as a substitute; ended up losing my screen upon restart and had a hard time recovering my VM (had to attach the OS disk to a recovery VM and edit the registry to re-enable that)!

I believe it's due to the fact that not every time does the VM get allocated the same GPU that was in use when the driver was first installed, unlike when you literally own the GPU and have it available locally in your rig. Plus since it's 1/8th of the real thing, you can't reserve it either!

You might wanna look at https://community.amd.com/thread/235357.
I wish somebody provides some workaround soon. This is really annoying...

· 1 · Share
10 |1000 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

Hardware affinity really sounds like it could be on the right track... it also explains the 'luck of the draw' nature that myself and others seem to have experienced. Waiting to hear back, but this is completely unusable in the meantime.

0 Votes 0 · ·
rickman avatar image
0 Votes"
rickman answered ·

Hi there; thanks for the question.

As vikancha commented, we have identified the problem and expect to roll out a fix soon. Once we've rolled out the fix, affected customers should:

  1. Ensure that NVv4 VMs are running the graphics drivers provided by Azure at Install AMD GPU drivers on N-series VMs running Windows, not graphics drivers acquired from other sources.

  2. Stop/start affected VMs so they come up again on the updated infrastructure.

· 7 · Share
10 |1000 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

How "soon" can we expect this fix?

0 Votes 0 · ·
vikancha avatar image vikancha DickJansenVerdelICTMedia-3286 ·

We are rolling out the fix in US East today and expect to cover US SouthCentral and Europe West tomorrow. We will provide another update tomorrow.

1 Vote 1 · ·

Can we confirm it was rolled out to US East yesterday? I deployed a new NVv4 VM to US East about an hour ago and had the exact same error after a reboot.

0 Votes 0 · ·
Show more comments
KoffskiJoshuaA-3543 avatar image
0 Votes"
KoffskiJoshuaA-3543 answered ·

So since the rollout of the fix have people had a lot of success with this? I find it's a bit of a crapshoot if I get the error 43 or not. It will survive a few reboots and stops, but eventually I run into the same issue again.

· 1 · Share
10 |1000 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

Same here... Haven't had even a single "success" yet!

0 Votes 0 · ·
vikancha avatar image
0 Votes"
vikancha answered ·

The rollout of the fix requires us to a perform a coordinated planned maintenance across the entire NVv4 infrastructure. Part of the infrastructure is already updated but to guarantee that the VM is deployed on the updated infrastructure, we have to perform a planned maintenance. We are initialing the process to notify customers by tomorrow about the upcoming maintenance window (the weekend of 6/6-6/7) and the action they need to take. The notification will include instructions to start the maintenance yourself, at a time that works for you.

· 1 · Share
10 |1000 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

@vikancha Please update on this thread if you have any further updates on this issue for benefit of community.

0 Votes 0 · ·