HPC Pack 2019: Unable to submit MPI job on Compute node

zainul abiddin 1 Reputation point
2021-01-13T10:30:12.927+00:00

Hi All,

Hi,
looking for support on Windows HPC pack 2019. On-premise Setup of Headnode and Compute node and both are Online,
56165-image.png
head node we are able to submit mpi jobs using JobManager and command line.

job submit /numnodes:1 mpiexec -c 1 -env OMP_NUM_THREADS 4 C:\Users\Administrator\Desktop\Work_Dir\input\hello_omp.exe

But not able to submit mpi jobs on compute node and error is: System can not find the file specified.

job submit /numnodes:2 mpiexec -c 1 -env OMP_NUM_THREADS 4 C:\Users\Administrator\Desktop\Work_Dir\input\hello_omp.exe

56038-image.png
Please help me on this.
--
Thanks and Regards
Zainul Abiddin

Azure Virtual Machines
Azure Virtual Machines
An Azure service that is used to provision Windows and Linux virtual machines.
7,123 questions
0 comments No comments
{count} votes

3 answers

Sort by: Most helpful
  1. Sumarigo-MSFT 43,561 Reputation points Microsoft Employee
    2021-01-13T16:10:52.987+00:00

    @zainul abiddin Welcome to Microsoft Q&A,

    This issue requires some deeper investigation, I wish to engage with you offline for a closer look and provide a quick and specialized assistance, please send an email with subject line “Attn:subm” to AzCommunity[at]Microsoft[dot]com referencing this thread and the Azure subscription ID, I will follow-up with you. Once again, apologies for any inconvenience with this issue.

    Thank you for your cooperation on this matter and look forward to your reply.

    -----------------------------------------------------------------------------------------------------------------------------------

    If the Answer is helpful, please click "Accept Answer" and upvote it.

    0 comments No comments

  2. Alexander McQuarters 1 Reputation point
    2021-01-15T15:15:18.077+00:00

    I would try using network paths instead of local paths for working directory, input, and output directories(\servername......). That has worked for me when running jobs on multiple nodes.

    0 comments No comments

  3. Thomas 21 Reputation points
    2021-03-15T19:45:38.787+00:00

    You have to setup network shares to run from all nodes. We did it so on our cluster. All users have their own directory on \<headnode>\users\<username>. Hope that helps.

    Kind regards
    Thomas

    0 comments No comments