[openib-general] uDAPL open HCA problem

Sayantan Sur surs at cse.ohio-state.edu
Tue Oct 25 13:58:03 PDT 2005


* On Oct,10 James Lentini<jlentini at netapp.com> wrote :
> 
> 
> On Fri, 21 Oct 2005, LEI CHAI wrote:
> 
> >  ips_by_gid: RET 0 at_rec 0x7fffffa8d380 -> id 4627
> >  dapli_at_event_cb()
> >  ip_comp_handler: rec 0x7fffffa8d380 ->id 4627 id 4627 num -22 3c66c000
> >  ip_comp_handler: resolution err -22 retry 1
> >  ip_comp_handler: ips_by_gid 0 rec 0x7fffffa8d380->id 4628
> >  dapli_at_event_cb()
> >  ip_comp_handler: rec 0x7fffffa8d380 ->id 4628 id 4628 num -22 0
> >  ip_comp_handler: resolution err -22 retry 2
> > [rdma_udapl_priv.c:640] error(262144): Cannot open IA
> >  ip_comp_handler: ips_by_gid 0 rec 0x7fffffa8d380->id 4629
> >  dapli_at_event_cb()
> >  ip_comp_handler: rec 0x7fffffa8d380 ->id 4629 id 4629 num -22 0
> >  ip_comp_handler: resolution err -22 retry 3
> >  ip_comp_handler: ips_by_gid 0 rec 0x7fffffa8d380->id 4630
> >  dapli_at_event_cb()
> >  ip_comp_handler: rec 0x7fffffa8d380 ->id 4630 id 4630 num -22 0
> >  ip_comp_handler: resolution err -22 retry 4
> >  ip_comp_handler: ERR: at_rec 0x7fffffa8d380, id 4630 num -22
> >  open_hca: ERR ib_at_ips_by_gid for mthca0
> 
> ib_at_ips_by_gid is failing again. Have you setup an IPoIB address?  

Sorry for the late reply :-( Yes, we have IPoIB setup. This happens
intermittently. As suggested by Woody, we will also try out the scm
version.

Thanks,
Sayantan.

=====

ib0       Link encap:UNSPEC  HWaddr
00-00-04-04-FE-80-00-00-00-00-00-00-00-00-00-00  
          inet addr:150.1.110.4  Bcast:150.1.255.255  Mask:255.255.0.0
          inet6 addr: fe80::202:c902:40:315/64 Scope:Link
          UP BROADCAST RUNNING MULTICAST  MTU:2044  Metric:1
          RX packets:0 errors:0 dropped:0 overruns:0 frame:0
          TX packets:0 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:128 
          RX bytes:0 (0.0 b)  TX bytes:0 (0.0 b)


lsmod | grep ^ib

[surs at ro1:~] lsmod | grep ^ib
ib_ipoib               48008  0 
ib_uat                 14840  0 
ib_at                  25696  1 ib_uat
ib_sa                  17804  2 ib_ipoib,ib_at
ib_ucm                 22280  0 
ib_cm                  37744  1 ib_ucm
ib_uverbs              35992  0 
ib_umad                18208  0 
ib_mthca              122656  0 
ib_mad                 44072  4 ib_sa,ib_cm,ib_umad,ib_mthca
ib_core                56192  8
ib_ipoib,ib_sa,ib_ucm,ib_cm,ib_uverbs,ib_umad,ib_mthca,ib_mad



Error:

[chail at ro1:osu_benchmarks] ../bin/mpiexec -n 2 ./a.out
DAPL: NOT Setting Loopback
 dapl_ib_init:
 ib_thread_init(7629)
dapl_ia_open (ib0, 8, 0x7fffffe20a18, 0x5ae728)
 open_hca: mthca0 - 0x5c4390
 ib_thread(7629,0x40200960): ENTER: pipe 8 at 4
 open_hca: Found dev mthca0 0002c90200400314
 open_hca: GID subnet fe80000000000000 id 0002c90200400315
 ips_by_gid: RET 0 at_rec 0x7fffffe20780 -> id 37
 dapli_at_event_cb()
 ip_comp_handler: rec 0x7fffffe20780 ->id 37 id 37 num -22 3c77c000
 ip_comp_handler: resolution err -22 retry 1
 ip_comp_handler: ips_by_gid 0 rec 0x7fffffe20780->id 38
 dapli_at_event_cb()
 ip_comp_handler: rec 0x7fffffe20780 ->id 38 id 38 num -22 0
 ip_comp_handler: resolution err -22 retry 2
 ip_comp_handler: ips_by_gid 0 rec 0x7fffffe20780->id 39
 dapli_at_event_cb()
 ip_comp_handler: rec 0x7fffffe20780 ->id 39 id 39 num -22 0
 ip_comp_handler: resolution err -22 retry 3
 ip_comp_handler: ips_by_gid 0 rec 0x7fffffe20780->id 40
 dapli_at_event_cb()
 ip_comp_handler: rec 0x7fffffe20780 ->id 40 id 40 num -22 0
 ip_comp_handler: resolution err -22 retry 4
 ip_comp_handler: ERR: at_rec 0x7fffffe20780, id 40 num -22
 open_hca: ERR ib_at_ips_by_gid for mthca0
dapls_ib_open_hca failed 40000
dapl_ia_open () returns 0x40000
DAPL: Stopped (dapl_fini)
 dapl_ib_release:
 ib_thread_destroy(7629)
 ib_thread_destroy: waiting for ib_thread
 ib_thread(7629) EXIT
[rdma_udapl_priv.c:640] error(262144): Cannot open IA
DAPL: NOT Setting Loopback
 dapl_ib_init:
 ib_thread_init(7630)
dapl_ia_open (ib0, 8, 0x7fffffe55578, 0x5ae728)
 open_hca: mthca0 - 0x5c4390
 ib_thread(7630,0x40200960): ENTER: pipe 8 at 4
 open_hca: Found dev mthca0 0002c90200400314
 open_hca: GID subnet fe80000000000000 id 0002c90200400315
 ips_by_gid: RET 0 at_rec 0x7fffffe552e0 -> id 41
 dapli_at_event_cb()
 ip_comp_handler: rec 0x7fffffe552e0 ->id 41 id 41 num -22 3c77c000
 ip_comp_handler: resolution err -22 retry 1
 ip_comp_handler: ips_by_gid 0 rec 0x7fffffe552e0->id 42
 dapli_at_event_cb()
 ip_comp_handler: rec 0x7fffffe552e0 ->id 42 id 42 num -22 0
 ip_comp_handler: resolution err -22 retry 2
 ip_comp_handler: ips_by_gid 0 rec 0x7fffffe552e0->id 43
 dapli_at_event_cb()
 ip_comp_handler: rec 0x7fffffe552e0 ->id 43 id 43 num -22 0
 ip_comp_handler: resolution err -22 retry 3
 ip_comp_handler: ips_by_gid 0 rec 0x7fffffe552e0->id 44
 dapli_at_event_cb()
 ip_comp_handler: rec 0x7fffffe552e0 ->id 44 id 44 num -22 0
 ip_comp_handler: resolution err -22 retry 4
 ip_comp_handler: ERR: at_rec 0x7fffffe552e0, id 44 num -22
[rdma_udapl_priv.c:640] error(262144): Cannot open IA
 open_hca: ERR ib_at_ips_by_gid for mthca0
dapls_ib_open_hca failed 40000
dapl_ia_open () returns 0x40000


> _______________________________________________
> openib-general mailing list
> openib-general at openib.org
> http://openib.org/mailman/listinfo/openib-general
> 
> To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general

-- 
http://www.cse.ohio-state.edu/~surs



More information about the general mailing list