Open MPI - spawn processes
Hi guys, I am new in OpenMPI so I will try to be as clear as possible here.
I instaled Yade 2021.01a on my cluster singularity/
I can run simulations using all cores I want. My cluster has 160 nodes, each node 80 cpu's.
So far so good.
I am now trying to run multiple nodes. For this, I am checking out this example [1].
When I run it I am getting the following message:
+ singularity run /beegfs/
/usr/lib/
boot.
TCP python prompt on localhost:9000, auth cookie `ksaeuc'
Welcome to Yade 2021.01a
Using python version: 3.9.7 (default, Sep 24 2021, 09:43:00)
[GCC 10.3.0]
Warning: no X rendering available (see https:/
XMLRPC info provider on http://
Running script Parallel.py
Traceback (most recent call last):
File "/usr/bin/yade", line 343, in runScript
execfile(
File "/usr/lib/
exec_(code, myglobals, mylocals)
File "Parallel.py", line 28, in <module>
mp.
File "/usr/lib/
comm_slave = MPI.COMM_
File "mpi4py/
mpi4py.
[95mMaster: will spawn 9 workers running: /usr/bin/yade ['-j5', 'Parallel.py'] [0m
[[ ^L clears screen, ^U kills line. [1mF8[0m plot. ]]
In [1]: Do you really want to exit ([y]/n)?
I am not sure from where it is comming. Any idea?
This is how I am running it in my Batch:
#!/bin/bash -x
#SBATCH --nodes=2
#SBATCH --ntasks=2
#SBATCH --cpus-per-task=80
#SBATCH --partition=compute
#SBATCH --job-name=
#SBATCH --time=10:00:00
singularity run /beegfs/
PS. I am supposing that numMPIThreads = 10 in the python script is equal to nodes*-j (2*5 in this case).
[1]https:/
Question information
- Language:
- English Edit question
- Status:
- Solved
- For:
- Yade Edit question
- Assignee:
- No assignee Edit question
- Solved by:
- Luis Barbosa
- Solved:
- Last query:
- Last reply: