[openib-general] uDAPL open HCA problem
Sayantan Sur
surs at cse.ohio-state.edu
Tue Oct 25 13:58:03 PDT 2005
* On Oct,10 James Lentini<jlentini at netapp.com> wrote :
>
>
> On Fri, 21 Oct 2005, LEI CHAI wrote:
>
> > ips_by_gid: RET 0 at_rec 0x7fffffa8d380 -> id 4627
> > dapli_at_event_cb()
> > ip_comp_handler: rec 0x7fffffa8d380 ->id 4627 id 4627 num -22 3c66c000
> > ip_comp_handler: resolution err -22 retry 1
> > ip_comp_handler: ips_by_gid 0 rec 0x7fffffa8d380->id 4628
> > dapli_at_event_cb()
> > ip_comp_handler: rec 0x7fffffa8d380 ->id 4628 id 4628 num -22 0
> > ip_comp_handler: resolution err -22 retry 2
> > [rdma_udapl_priv.c:640] error(262144): Cannot open IA
> > ip_comp_handler: ips_by_gid 0 rec 0x7fffffa8d380->id 4629
> > dapli_at_event_cb()
> > ip_comp_handler: rec 0x7fffffa8d380 ->id 4629 id 4629 num -22 0
> > ip_comp_handler: resolution err -22 retry 3
> > ip_comp_handler: ips_by_gid 0 rec 0x7fffffa8d380->id 4630
> > dapli_at_event_cb()
> > ip_comp_handler: rec 0x7fffffa8d380 ->id 4630 id 4630 num -22 0
> > ip_comp_handler: resolution err -22 retry 4
> > ip_comp_handler: ERR: at_rec 0x7fffffa8d380, id 4630 num -22
> > open_hca: ERR ib_at_ips_by_gid for mthca0
>
> ib_at_ips_by_gid is failing again. Have you setup an IPoIB address?
Sorry for the late reply :-( Yes, we have IPoIB setup. This happens
intermittently. As suggested by Woody, we will also try out the scm
version.
Thanks,
Sayantan.
=====
ib0 Link encap:UNSPEC HWaddr
00-00-04-04-FE-80-00-00-00-00-00-00-00-00-00-00
inet addr:150.1.110.4 Bcast:150.1.255.255 Mask:255.255.0.0
inet6 addr: fe80::202:c902:40:315/64 Scope:Link
UP BROADCAST RUNNING MULTICAST MTU:2044 Metric:1
RX packets:0 errors:0 dropped:0 overruns:0 frame:0
TX packets:0 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:128
RX bytes:0 (0.0 b) TX bytes:0 (0.0 b)
lsmod | grep ^ib
[surs at ro1:~] lsmod | grep ^ib
ib_ipoib 48008 0
ib_uat 14840 0
ib_at 25696 1 ib_uat
ib_sa 17804 2 ib_ipoib,ib_at
ib_ucm 22280 0
ib_cm 37744 1 ib_ucm
ib_uverbs 35992 0
ib_umad 18208 0
ib_mthca 122656 0
ib_mad 44072 4 ib_sa,ib_cm,ib_umad,ib_mthca
ib_core 56192 8
ib_ipoib,ib_sa,ib_ucm,ib_cm,ib_uverbs,ib_umad,ib_mthca,ib_mad
Error:
[chail at ro1:osu_benchmarks] ../bin/mpiexec -n 2 ./a.out
DAPL: NOT Setting Loopback
dapl_ib_init:
ib_thread_init(7629)
dapl_ia_open (ib0, 8, 0x7fffffe20a18, 0x5ae728)
open_hca: mthca0 - 0x5c4390
ib_thread(7629,0x40200960): ENTER: pipe 8 at 4
open_hca: Found dev mthca0 0002c90200400314
open_hca: GID subnet fe80000000000000 id 0002c90200400315
ips_by_gid: RET 0 at_rec 0x7fffffe20780 -> id 37
dapli_at_event_cb()
ip_comp_handler: rec 0x7fffffe20780 ->id 37 id 37 num -22 3c77c000
ip_comp_handler: resolution err -22 retry 1
ip_comp_handler: ips_by_gid 0 rec 0x7fffffe20780->id 38
dapli_at_event_cb()
ip_comp_handler: rec 0x7fffffe20780 ->id 38 id 38 num -22 0
ip_comp_handler: resolution err -22 retry 2
ip_comp_handler: ips_by_gid 0 rec 0x7fffffe20780->id 39
dapli_at_event_cb()
ip_comp_handler: rec 0x7fffffe20780 ->id 39 id 39 num -22 0
ip_comp_handler: resolution err -22 retry 3
ip_comp_handler: ips_by_gid 0 rec 0x7fffffe20780->id 40
dapli_at_event_cb()
ip_comp_handler: rec 0x7fffffe20780 ->id 40 id 40 num -22 0
ip_comp_handler: resolution err -22 retry 4
ip_comp_handler: ERR: at_rec 0x7fffffe20780, id 40 num -22
open_hca: ERR ib_at_ips_by_gid for mthca0
dapls_ib_open_hca failed 40000
dapl_ia_open () returns 0x40000
DAPL: Stopped (dapl_fini)
dapl_ib_release:
ib_thread_destroy(7629)
ib_thread_destroy: waiting for ib_thread
ib_thread(7629) EXIT
[rdma_udapl_priv.c:640] error(262144): Cannot open IA
DAPL: NOT Setting Loopback
dapl_ib_init:
ib_thread_init(7630)
dapl_ia_open (ib0, 8, 0x7fffffe55578, 0x5ae728)
open_hca: mthca0 - 0x5c4390
ib_thread(7630,0x40200960): ENTER: pipe 8 at 4
open_hca: Found dev mthca0 0002c90200400314
open_hca: GID subnet fe80000000000000 id 0002c90200400315
ips_by_gid: RET 0 at_rec 0x7fffffe552e0 -> id 41
dapli_at_event_cb()
ip_comp_handler: rec 0x7fffffe552e0 ->id 41 id 41 num -22 3c77c000
ip_comp_handler: resolution err -22 retry 1
ip_comp_handler: ips_by_gid 0 rec 0x7fffffe552e0->id 42
dapli_at_event_cb()
ip_comp_handler: rec 0x7fffffe552e0 ->id 42 id 42 num -22 0
ip_comp_handler: resolution err -22 retry 2
ip_comp_handler: ips_by_gid 0 rec 0x7fffffe552e0->id 43
dapli_at_event_cb()
ip_comp_handler: rec 0x7fffffe552e0 ->id 43 id 43 num -22 0
ip_comp_handler: resolution err -22 retry 3
ip_comp_handler: ips_by_gid 0 rec 0x7fffffe552e0->id 44
dapli_at_event_cb()
ip_comp_handler: rec 0x7fffffe552e0 ->id 44 id 44 num -22 0
ip_comp_handler: resolution err -22 retry 4
ip_comp_handler: ERR: at_rec 0x7fffffe552e0, id 44 num -22
[rdma_udapl_priv.c:640] error(262144): Cannot open IA
open_hca: ERR ib_at_ips_by_gid for mthca0
dapls_ib_open_hca failed 40000
dapl_ia_open () returns 0x40000
> _______________________________________________
> openib-general mailing list
> openib-general at openib.org
> http://openib.org/mailman/listinfo/openib-general
>
> To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general
--
http://www.cse.ohio-state.edu/~surs
More information about the general
mailing list