[openib-general] mpirun_mpd crashing

Liang Peng Liang.Peng at Sun.COM
Tue May 16 21:44:26 PDT 2006


Hi there,

Not sure whether this is the proper place to post, but we encounter some 
mpirun_mpd crashing problems in testing Voltaire MPI (based on MVAPICH) 
with Sun studio 11 compilers on SuSE Linux 9 SP3 (Opteron).  Hope 
someone can provide some hints:


MVAPICH version: 0.9.4 with Voltaire's modifications
Compiler used: Sun Studio 11
Problem:

When using the mpd version of MVAPICH, mpirun crashes with the following:

 > mpirun_mpd -np 2 /usr/voltaire/mpi.cc.mpd/bin/cpi
[man_0]: [cli_0]: client_bnr_get failed
[cli_1]: MPD_Man_msg_handler received unexpected msg 
:cmd=client_bnr_get_output val=apstc-g4:00024400:
:
handle_lhs_msgs_input: failed for bnr_get: buf=:cmd=bnr_get src=man_0 
dest=man_0 bcast=true attr=MVAPICH_0001\^ gid=0
:
[man_0]: application program exited abnormally with status 0
[man_0]: application program signaled with signal 11 (: Segmentation fault)

The "rsh" version is working properly, and the gcc compiled version of 
mpd is working on the same machine.

Thanks!


Regards, 
Liang Peng

-- 
Research Scientist
Large Scale Computing
Asia Pacific Science & Technology Center
Sun Microsystems, Inc. 
and
Nanyang Technological University, Singapore





More information about the general mailing list