[openib-general] HPCC benchmark aborts at MPIRandomAccess test

David Costa David.Costa at Sun.COM
Fri Dec 1 14:20:31 PST 2006


Hello all,

I am running the HPCC benchmark on a Sun Blade 8000 blade server. I have 
two blades running RHEL4U3 and SLESSP3 respectively with 32 GBytes of 
memory each. The HPCC benchmark is running on a sun developed IB module 
that uses the Mellanox 25204 chips. When it gets to the MPIRandomAccess 
test, it immediately fails and I see the following messages listed below.

Does anyone know what the messages mean, and a possible  underlying 
cause?  Please reply to me directly as I am not subscribed to this list.

Thank you,

Dave Costa
david.costa at sun.com


[root at an1-bl0 ~]# mpirun_rsh -rsh -np 32 -hostfile /root/hostfile 
/usr/local/bin/hpcc
24 - MPI_CANCEL : Internal MPI error!
[24] [] Aborting Program!
mpirun_rsh: Abort signaled from [24]
26 - MPI_CANCEL : Internal MPI error!
[26] [] Aborting Program!
15 - MPI_CANCEL : Internal MPI error!
[15] [] Aborting Program!
18 - MPI_CANCEL : Internal MPI error!
[18] [] Aborting Program!
22 - MPI_CANCEL : Internal MPI error!
[22] [] Aborting Program!
4 - MPI_CANCEL : Internal MPI error!
[4] [] Aborting Program!
13 - MPI_CANCEL : Internal MPI error!
[13] [] Aborting Program!
11 - MPI_CANCEL : Internal MPI error!
16 - MPI_CANCEL : Internal MPI error!
[16] [] Aborting Program!
[11] [] Aborting Program!
28 - MPI_CANCEL : Internal MPI error!
[28] [] Aborting Program!
[19] Abort: [an1-bl1:19] Got completion with error, code=12
 at line 2365 in file viacheck.c
[23] Abort: [an1-bl1:23] Got completion with error, code=12
 at line 2365 in file viacheck.c
[17] Abort: [an1-bl1:17] Got completion with error, code=12
 at line 2365 in file viacheck.c
done.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openfabrics.org/pipermail/general/attachments/20061201/a9028e71/attachment.html>


More information about the general mailing list