[ofa-general] ofed1.2.5rc2 and intel mpi error

Mahmoud Hanafi mhanafi at csc.com
Fri Feb 1 13:17:08 PST 2008


Thanks for the tips. I got past the errors using the I_MPI_RDMA_RNDV_WRITE 
setting. But now I get the following error 


unexpected DAPL event 4008 from 111:n29
.
.
.

Any ideas?

I am using Cisco HCA's
MT25208 Tavor Compat, DLGL revision A0
FW v4.7.6 build 3.2.0.118

Mahmoud Hanafi
Sr. System Administrator
CSC HPC COE
Bld. 676
2435 Fifth Street
WPAFB, Ohio 45433
(937) 255-1536


Computer Sciences Corporation 
Registered Office: 2100 East Grand Avenue, El Segundo California 90245, 
USA
Registered in USA No: C-489-59

----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
This is a PRIVATE message. If you are not the intended recipient, please 
delete without copying and kindly advise us by e-mail of the mistake in 
delivery. 
NOTE: Regardless of content, this e-mail shall not operate to bind CSC to 
any order or other contract unless pursuant to explicit written agreement 
or government initiative expressly permitting the use of e-mail for such 
purpose.
----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------




Arlin Davis <ardavis at ichips.intel.com> 
Sent by: general-bounces at lists.openfabrics.org
02/01/2008 12:58 PM

To
Mahmoud Hanafi/DEF/CSC at CSC
cc
general-bounces at lists.openfabrics.org, general at lists.openfabrics.org
Subject
Re: [ofa-general] ofed1.2.5rc2 and intel mpi error






Mahmoud Hanafi wrote:
> 
> here is my mpirun command
> 
> mpirun -np 128 -env I_MPI_DEVICE rdma:OpenIB-cma -env I_MPI_DEBUG 2 
> /home/hanafim/HPL/xhpl
> 

What IB adapter are you using and what F/W version (ibstat)?

This might be related to following bug with large RDMA reads: 
https://bugs.openfabrics.org//show_bug.cgi?id=736

Try the following:

Set "-env I_MPI_RDMA_MAX_MSG_SIZE 4194304" to workaround the issue. If 
it does not help, set I_MPI_RDMA_RNDV_WRITE to enable to use RDMA Write 
rendezvous protocol instead of the default RDMA Read.

-arlin

_______________________________________________
general mailing list
general at lists.openfabrics.org
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general

To unsubscribe, please visit 
http://openib.org/mailman/listinfo/openib-general

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openfabrics.org/pipermail/general/attachments/20080201/2256deab/attachment.html>


More information about the general mailing list