[openib-general] Problems running MPI jobs with MVAPICH

Don.Albert at Bull.com Don.Albert at Bull.com
Wed Mar 29 06:23:36 PST 2006


Weikuan,

> Could you try to run MVAPICH (1 or 2) with tcp option first, or use
> mpirun_rsh initially? Just to be sure you can run any MPI to start with.
> Please let us know of the problems too, along with the same set of 
traces
> and machine specification.
> 
With no changes to the software, I tried "mpirun_rsh".  If I use a simple 
command like "uname" it works:

[koa] (ib) ib> mpirun_rsh -np 2 koa jatoba uname -a
Linux jatoba.az05.bull.com 2.6.9-34.ELsmp #1 SMP Fri Feb 24 16:56:28 EST 
2006 x86_64 x86_64 x86_64 GNU/Linux
Linux koa.az05.bull.com 2.6.9-34.ELsmp #1 SMP Fri Feb 24 16:56:28 EST 2006 
x86_64 x86_64 x86_64 GNU/Linux

but if I try to run the MPI job, it fails with a segmentation fault on 
both sides:

[koa] (ib) ib> mpirun_rsh -np 2 koa jatoba /home/ib/mpi/tests/cpi/cpi
bash: line 1: 16302 Segmentation fault      /usr/bin/env MPIRUN_MPD=0 
MPIRUN_HOST=koa.az05.bull.com MPIRUN_PORT=32945 MPIRUN_PROCESS
ES='koa:jatoba:' MPIRUN_RANK=1 MPIRUN_NPROCS=2 MPIRUN_ID=19842 
DISPLAY=dalbertpc.usnetwork.lan:0 /home/ib/mpi/tests/cpi/cpi
bash: line 1: 19866 Segmentation fault      /usr/bin/env MPIRUN_MPD=0 
MPIRUN_HOST=koa.az05.bull.com MPIRUN_PORT=32945 MPIRUN_PROCESS
ES='koa:jatoba:' MPIRUN_RANK=0 MPIRUN_NPROCS=2 MPIRUN_ID=19842 
DISPLAY=dalbertpc.usnetwork.lan:0 /home/ib/mpi/tests/cpi/cpi

The MVAPICH User Guide document talks about a script "make.mvapich.tcp" to 
build for running over IPoIB, but the script is not in the "mvapich-gen2" 
directory.  There is a script "make.mvapich2.tcp" under the 
"mvapich2-gen2" directory.  I will try building MVAPICH2 for TCP and see 
if that works.

        -Don Albert-
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openfabrics.org/pipermail/general/attachments/20060329/694d1c03/attachment.html>


More information about the general mailing list