Can I use two compute cluster in one training??

R@J@+ S!Nh@ 1 Reputation point
2021-03-25T15:21:34.54+00:00

I have two compute clusters of V100 GPU each(named as - "Cluster1" with 2 nodes and "Cluster2" with 2 nodes), I want to use both these clusters in my training script(Pytorch Training).

Right now, I can use "Cluster1" or "Cluster2" in my Compute target code and my training code is getting 1 V100 GPU for training. Can anybody help on how to use both Clusters(Cluster1 and Cluster2) in my Compute target code, so that my training script can see 2 V100 GPUs not 1 GPU.

Can you tell how this will be achieved, also please make me understand what are the 2 nodes means in one V100 GPU cluster(it is not two GPUs that I know for sure). But then what does that 2 nodes in one cluster means and how it works??

Not Monitored
Not Monitored
Tag not monitored by Microsoft.
35,815 questions
0 comments No comments
{count} votes

1 answer

Sort by: Most helpful
  1. Dave Patrick 426K Reputation points MVP
    2021-03-25T15:23:11.75+00:00

    Microsoft Certification Program is supported on their own forums. I'd try asking for help with course issues in dedicated forums here. (Participate\Ask A Question)
    Courses and Course Content/Course Content Issue

    --please don't forget to Accept as answer if the reply is helpful--

    0 comments No comments