[openib-general] [PATCH] OFED 1.1-rc3 is ready

Woodruff, Robert J robert.j.woodruff at intel.com
Thu Sep 14 12:52:08 PDT 2006


Robert Walsh wrote, 
> 
> [woody at rkl-13 bin]$ ./ib_rdma_bw -n 10000 -t 1000 -s 2000000 rkl-12
> 4730: | port=18515 | ib_port=1 | size=2000000 | tx_depth=1000 |
> iters=10000 | duplex=0 | cma=0 |
> 4730: Local address:  LID 0x03, QPN 0x001d, PSN 0x9e070c RKey
0x2302400
> VAddr 0x00002a95dd3480
> 4730: Remote address: LID 0x04, QPN 0x001e, PSN 0x2bd6ba, RKey
0x2402500
> VAddr 0x00002a95c85480
> 4730:main: Completion with error at client:
> 4730:main: Failed status 9: wr_id 3
> 4730:main: scnt=7584, ccnt=6584
> [woody at rkl-13 bin]$  

>Hi Woody,
Robert Walsh wrote, 
>When RC4 is available, there should be a patch in there that will fix
>this.  Can you let us know if you continue to see problems?

>Regards,
> Robert.

I installed RC5 and now it just hangs, 

[woody at rkl-13 bin]$ ./ib_rdma_bw -n 10000 -t 1000 -s 2000000 rkl-12
4702: | port=18515 | ib_port=1 | size=2000000 | tx_depth=1000 |
iters=10000 | duplex=0 | cma=0 |
4702: Local address:  LID 0x03, QPN 0x000d, PSN 0xf1b711 RKey 0x1101200
VAddr 0x00002a95dc8480
4702: Remote address: LID 0x04, QPN 0x000d, PSN 0xe62247, RKey 0x1101200
VAddr 0x00002a95c7c480
hangs here and have to cntrl-c the test.


Intel MPI also fails with, 
# Barrier
[1][rdma_iba.c:260] Intel MPI fatal error: DTO operation completed with
error. status=0x8. cookie=0x514ee0
rank 1 in job 4  rkl-13_32779   caused collective abort of all ranks
  exit status of rank 1: killed by signal 9 

woody




More information about the general mailing list