[openib-general] 3513 DAPL is Broken

James Lentini jlentini at netapp.com
Thu Sep 22 09:09:39 PDT 2005


Is there a way to reproduce this with dapltest? 

Our 6 canonical regression tests (see 
test/dapltest/scripts/regress.sh), don't encounter this problem on 
revision 3521.

Are you sure the application is hanging in DAPL? Can you enable DAPL 
debugging and send the output (see doc/dapl_environ.txt and 
doc/dat_environ.txt)?

Thanks,
james

On Wed, 21 Sep 2005, Woodruff, Robert J wrote:

> Seems to hang around the time of the modify QP.
> 
> ibv_rc_pingpong seems to work OK and also your 
> DAPL-socket CM version that you gave me yesterday seems 
> to work, but the DAPL I pulled from SVN that uses the IB AT/CM
> has the following problem. 
> 
> I am starting to think that pushing out your socket CM
> version until things stabilize with the IBAT/IBCM version
> might be worth considering, so that people that
> want to use DAPL now have something that is reliable.
> 
> woody
> 
> Here is the dapl trace when running Intel MPI on top of uDAPL 3513, 
> 
> dapl_ia_query (0x522860, (nil), 0x0, (nil), 0x3ffffff, 0x7fbfffe510)
> dapl_ia_query () returns 0x0
> dapl_evd_create () returns 0x0
>  setup_listener(ia_ptr 0x522860 SID 3545 sp 0x5238e0 conn 0x5239a0 id
> 5389248)
>  setup_listener(conn=0x5239a0 cm_id=5389248)
> dapl_ep_create (0x522860, 0x5235a0, 0x523620, 0x523620, 0x523780,
> 0x7fbfffecb0, 0x5201e8)
>  query_hca: MAX msg 2147483648 dto 65535 iov 59 rdma i4,o4
>  qp_alloc: ia_ptr 0x522860 ep_ptr 0x526740 ep_ctx_ptr 0x526740
>  qp_alloc: qpn 0xc0409 sq 1000,9 rq 1000,1
>  modify_qp: qp 0x523c50, state 1 qp_num 0xc0409
> dapl_ep_create (0x522860, 0x5235a0, 0x523620, 0x523620, 0x523780,
> 0x7fbfffecb0, 0x5203b8)
>  query_hca: MAX msg 2147483648 dto 65535 iov 59 rdma i4,o4
>  qp_alloc: ia_ptr 0x522860 ep_ptr 0x526a20 ep_ctx_ptr 0x526a20
>  qp_alloc: qpn 0xc040a sq 1000,9 rq 1000,1
>  modify_qp: qp 0x526d00, state 1 qp_num 0xc040a
> dapl_ep_create (0x522860, 0x5235a0, 0x523620, 0x523620, 0x523780,
> 0x7fbfffecb0, 0x520758)
>  query_hca: MAX msg 2147483648 dto 65535 iov 59 rdma i4,o4
>  qp_alloc: ia_ptr 0x522860 ep_ptr 0x593470 ep_ctx_ptr 0x593470
>  qp_alloc: qpn 0xc040b sq 1000,9 rq 1000,1
>  modify_qp: qp 0x526e40, state 1 qp_num 0xc040b



More information about the general mailing list