DSVM does not boot

Tomasz Grobelny 0 Reputation points
2023-06-03T23:26:29.1666667+00:00

Hi all,

Tried to create Ubuntu based DSVM on Standard NV6ads A10 v5, but the deployment is not completed because machine does not fully boot. On the console I see it enters infinite loop on loading kernel modules.

So the question is: is Ubuntu DSVM supposed to work on NV6ads A10 v5 or are there any requirements wrt VM size for DSVM?

Additional info: I tried setting up normal Ubuntu VM (20.04 or 22.04) and it works on same VM size. But then following https://learn.microsoft.com/en-us/azure/virtual-machines/linux/n-series-driver-setup to install nvidia cuda drivers results in the same infinite loop and inability to connect via ssh.

UPDATE: Finally managed to boot - it might be that SecureBoot was needed. Still something is wrong:

$ sudo /usr/bin/nvidia-smi
NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running.

Azure Data Science Virtual Machines
Azure Data Science Virtual Machines
Azure Virtual Machine images that are pre-installed, configured, and tested with several commonly used tools for data analytics, machine learning, and artificial intelligence training.
67 questions
0 comments No comments
{count} votes

1 answer

Sort by: Most helpful
  1. Sedat SALMAN 13,160 Reputation points
    2023-06-04T11:05:58.3333333+00:00

    According to the following GitHub discussion,

    https://github.com/ant-media/Ant-Media-Server/issues/4547

    the NV6ads A10 v5 VM size appears to have known issues with the CUDA driver on Ubuntu 20.04. The CUDA driver may not function properly on this system. The GitHub discussion suggests manually installing the CUDA driver. You can manually install the CUDA driver by following the instructions at this link:

    https://resources.antmedia.io/docs/using-nvidia-gpus

    1 person found this answer helpful.
    0 comments No comments