Manage partitioned GPUs

Completed

This unit describes which graphics processing unit (GPU) models are supported on an Azure Stack Hub multinode system. You can also find instructions on installing the drivers used with the GPUs. GPU support in Azure Stack Hub enables solutions such as Artificial Intelligence, training, inference, and data visualization. The AMD Radeon Instinct MI25 can be used to support graphic-intensive applications such as Autodesk AutoCAD.

You can choose from three GPU models in the public preview period. They are available in NVIDIA V100, NVIDIA T4, and AMD MI25 GPUs. These physical GPUs align with the following Azure N-Series virtual machine types as follows:

  • NCv3
  • NVv4 (AMD MI25)
  • NCasT4_v3

Azure Stack Hub GPU support is currently in public preview. This preview version is provided without a service level agreement, and it's not recommended for production workloads. Certain features might not be supported or might have constrained capabilities.

NCv3

NCv3-series virtual machines are powered by NVIDIA Tesla V100 GPUs. Customers can take advantage of these updated GPUs for traditional HPC workloads such as reservoir modeling, DNA sequencing, protein analysis, Monte Carlo simulations, and others.

Size vCPU Memory: GiB Temp storage (SSD) GiB GPU GPU memory: GiB Max data disks Max NICs
Standard_NC6s_v3 6 112 736 1 16 12 4
Standard_NC12s_v3 12 224 1474 2 32 24 8
Standard_NC24s_v3 24 448 2948 4 64 32 8

NVv4

The NVv4-series virtual machines are powered by AMD Radeon Instinct MI25 GPUs. With NVv4-series Azure Stack Hub is introducing virtual machines with partial GPUs. This size can be used for GPU accelerated graphics applications and virtual desktops. NVv4 virtual machines currently support only Windows guest operating system.

Size

vCPU

Memory: GiB

Temp storage (SSD) GiB

GPU

GPU memory: GiB

Max data disks

Max NICs

Standard_NV4as_v4

4

14

88

1/8

2

4

2

NCasT4_v3

Size

vCPU

Memory: GiB

GPU

GPU memory: GiB

Max data disks

Max NICs

Standard_NC4as_T4_v3

4

28

1

16

8

4

Standard_NC8as_T4_v3

8

56

1

16

16

8

Standard_NC16as_T4_v3

16

112

1

16

32

8

Standard_NC64as_T4_v3

64

448

4

64

32

8

Patch and update, FRU behavior of virtual machines

GPU virtual machines will undergo downtime during operations such as patch and update (PnU) and hardware replacement (FRU) of Azure Stack Hub. The following table goes over the state of the virtual machine as observed during these activities and the manual action that the user can do to make these virtual machines available again post these operations.

Operation

PnU - Express Update

PnU - Full Update, OEM update

FRU

Virtual machine state

Unavailable during and post update without manual start operation.

Unavailable during update. Available post update with manual operation

Unavailable during update. Available post update with manual operation

Manual operation

If the virtual machine needs to be made available during the update, if there are available GPU partitions, the virtual machine can be restarted from the portal by clicking the Restart button. Restart the virtual machine after the update from the portal using the Restart button.

Virtual machine cannot be made available during the update. Post update completion, virtual machine needs to be stop-deallocated using the Stop button and started back up using the "Start" button.

Virtual machine cannot be made available during the update.Post update completion, virtual machine needs to be stop-deallocated using the Stop button and started back up using the Start button.