-
Notifications
You must be signed in to change notification settings - Fork 25
Open
Description
I've hit this issue when trying to start EESSI from a container (note: this is not the EESSI container, it's just docker:rockylinux9 - I'm experimenting with mounting a squashfs filesystem providing EESSI into this container), but I expect it would happen on any system that doesn't have ssh or rsh.
mpirun -np 2 gmx_mpi --version
--------------------------------------------------------------------------
The value of the MCA parameter "plm_rsh_agent" was set to a path
that could not be found:
plm_rsh_agent: ssh : rsh
Please either unset the parameter, or check that the path is correct
--------------------------------------------------------------------------
[tcn43.local.snellius.surf.nl:99269] [[INVALID],INVALID] FORCE-TERMINATE AT Not found:-13 - error plm_rsh_component.c(335)
Now, I'm not saying we should have ssh and rsh in the compat layer, but... at this point, we have software in the software layer (namely: OpenMPI) which relies on OS dependencies (namely ssh and rsh) that are NOT present in the compatibility layer. That means that, on a system with very minimal OS, a natively installed EESSI would not be able to launch MPI tasks.
Metadata
Metadata
Assignees
Labels
No labels