Docker image building for GPU workload on AKS cluster

Andreas Christofi 96 Reputation points
2021-03-04T10:43:43.637+00:00

Hello, I've had some trouble building an image that can utilize the GPU on the AKS cluster. The instructions shown on this article were followed to install the Nvidia plugin. My understanding is that the image needs to contain the Nvidia drivers, therefore I've used the nvidia/cuda as the base image and tried to install specifically python 3.7.6 on it. However, the final image always ends up with python version 3.6.9.

Also, which cuda version should I install?

I would appreciate any suggestions on how that can be done properly.

Moreover, I'd like to know how to specify the Nvidia library locations in the Kubernetes manifest, as mentioned in this article.

Thanks!

Azure Kubernetes Service (AKS)
Azure Kubernetes Service (AKS)
An Azure service that provides serverless Kubernetes, an integrated continuous integration and continuous delivery experience, and enterprise-grade security and governance.
1,855 questions
{count} votes

Accepted answer
  1. Andreas Christofi 96 Reputation points
    2021-03-05T14:26:19.423+00:00

    Update:

    I've managed to make it work by using this Nvidia base image. Thereafter, installed python dependencies and updated the default python interpreter to be the version that I need. Also, the python-cffi package needs to be installed. Hope this helps anyone trying to do the same!

    0 comments No comments

0 additional answers

Sort by: Most helpful