Steve,<br>Somehow I get the following error message:<br><br><span style="color: rgb(0, 0, 255);">[0] Abort: [] Got completion with error 5, vendor code=a, dest rank=1</span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);"> at line 479 in file ibv_channel_manager.c</span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);"> [1] Abort: ibv_post_recv err with 22 at line 1420 in file rdma_iba_priv.c</span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);"> rank 1 in job 1 ammasso1_50414 caused collective abort of all ranks</span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);"> exit status of rank 1: killed by signal 9 </span><br style="color: rgb(0, 0, 255);"><br><br>For detail, please see the following:<br><span style="color: rgb(0, 0, 255);">[root@ammasso1 0.9.8-RELEASE]# vi /etc/hosts</span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0,
255);">[root@ammasso1 0.9.8-RELEASE]# cd bin</span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);">[root@ammasso1 bin]# ./mpdboot -n 2</span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);">debug: starting</span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);">mpdroot: perror msg: Connection refused</span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);">running mpdallexit on ammasso1</span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);">LAUNCHED mpd on ammasso1 via </span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);">debug: launch cmd= /root/0.9.8-RELEASE/bin/mpd.py --ncpus=1 -e -d</span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);">debug: mpd on ammasso1 on port 50414</span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);">RUNNING: mpd on
ammasso1</span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);">debug: info for running mpd: {'ncpus': 1, 'list_port': 50414, 'entry_port': '', 'host': 'ammasso1', 'entry_host': '', 'ifhn': ''}</span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);">LAUNCHED mpd on ammasso2 via ammasso1</span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);">debug: launch cmd= ssh -x -n ammasso2.</span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);"> '/root/0.9.8-RELEASE/bin/mpd.py -h ammasso1 -p 50414 --ncpus=1 -e -d' </span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);">root@ammasso2.'s password: </span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);">debug: mpd on ammasso2 on port 59327</span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);">RUNNING: mpd on ammasso2</span><br style="color:
rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);">debug: info for running mpd: {'entry_port': 50414, 'ncpus': 1, 'list_port': 59327, 'pid': 2997, 'host': 'ammasso2., 'entry_host': 'ammasso1', 'ifhn': ''}</span><br style="color: rgb(0, 0, 255);"><br style="color: rgb(0, 0, 255);"><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);">[root@ammasso1 bin]# ./mpiexec -n 2 /root/IMB_2.3/src/IMB-MPI1</span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);">secretword=</span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);">#---------------------------------------------------</span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);"># Intel (R) MPI Benchmark Suite V2.3, MPI-1 part </span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);">#---------------------------------------------------</span><br style="color: rgb(0, 0, 255);"><span
style="color: rgb(0, 0, 255);"># Date : Wed Dec 6 13:25:59 2006</span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);"># Machine : i686# System : Linux</span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);"># Release : 2.6.17.13</span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);"># Version : #1 SMP Wed Nov 8 17:34:14 PST 2006</span><br style="color: rgb(0, 0, 255);"><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);">#</span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);"># Minimum message length in bytes: 0</span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);"># Maximum message length in bytes: 4194304</span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);">#</span><br
style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);"># MPI_Datatype : MPI_BYTE </span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);"># MPI_Datatype for reductions : MPI_FLOAT</span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);"># MPI_Op : MPI_SUM </span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);">#</span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);">#</span><br style="color: rgb(0, 0, 255);"><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);"># List of Benchmarks to run:</span><br style="color: rgb(0, 0, 255);"><br style="color:
rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);"># PingPong</span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);"># PingPing</span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);"># Sendrecv</span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);"># Exchange</span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);"># Allreduce</span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);"># Reduce</span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);"># Reduce_scatter</span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);"># Allgather</span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);"># Allgatherv</span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);"># Alltoall</span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);"># Bcast</span><br style="color:
rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);"># Barrier</span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);">recv desc error, 128</span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);">[0] Abort: [] Got completion with error 5, vendor code=a, dest rank=1</span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);"> at line 479 in file ibv_channel_manager.c</span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);">[1] Abort: ibv_post_recv err with 22 at line 1420 in file rdma_iba_priv.c</span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);">rank 1 in job 1 ammasso1_50414 caused collective abort of all ranks</span><br style="color: rgb(0, 0, 255);"><span style="color: rgb(0, 0, 255);"> exit status of rank 1: killed by signal 9 </span><br style="color: rgb(0, 0, 255);"><br>David<br><br><br><br><br><b><i>Steve Wise
<swise@opengridcomputing.com></i></b> wrote:<blockquote class="replbq" style="border-left: 2px solid rgb(16, 16, 255); margin-left: 5px; padding-left: 5px;"> On Wed, 2006-12-06 at 11:17 -0800, david elsen wrote:<br>> Steve,<br>> <br>> Thanks a lot for the reply. <br>> <br>> I could run the cpi from the example directory. <br>> <br>> But I see some error message when trying to run the IMB-MPI1. I am<br>> using 219297_IMB_2.3. Which version are you using?<br><br>I'm running the same release.<br><br>Steve.<br><br></blockquote><br><p>
<hr size=1>Everyone is raving about <a href="http://us.rd.yahoo.com/evt=42297/*http://advision.webevents.yahoo.com/mailbeta">the all-new Yahoo! Mail beta.</a>