[ofw] Re: Completion with bad status: IBV_WC_RETRY_EXC_ERROR
Diego Guella
diego.guella at sircomtech.com
Tue Nov 13 05:13:25 PST 2007
Hi Tzachi,
I attached the sources, as you requested. Please tell me if they are stripped out by mail spam filters.
The Linux code is full of "#ifdef WIN32", treat them as "#if 0" since that code is never meant to be compiled on Windows. I leave that here to have some reference to what I should do.
----- Original Message -----
From: Tzachi Dar
To: Diego Guella ; Fab Tillier
Cc: ofw at lists.openfabrics.org ; Dotan Barak
Sent: Tuesday, November 13, 2007 2:04 PM
Subject: RE: Completion with bad status: IBV_WC_RETRY_EXC_ERROR
Can you please send us your source code (Linux and windows)
----------------------------------------------------------------------------
From: Diego Guella [mailto:diego.guella at sircomtech.com]
Sent: Tuesday, November 13, 2007 2:45 PM
To: Tzachi Dar; Fab Tillier
Cc: ofw at lists.openfabrics.org
Subject: Re: Completion with bad status: IBV_WC_RETRY_EXC_ERROR
Hi Tzachi,
The error is on the "daemon" side (sender) at the polling of the cq, I get a completion work, but the status is not success.
The "client" (receiver) don't sees nothing, in the recv case none of the recv wr are used. In the rdma case, it is normal that the recv wr isn't used.
I am using either send/recv and rdma operations but the results are the same.
I am doing only one operation, the code is the example in WinIB 1.3, under Mellanox\WinIB\SDK\examples\hello_world
with some slight modifications on Windows to get RDMA working, and heavy modifications on Linux to use the Verbs interface instead of ib_al.
I can send you the sources if you want.
The same program works OK on Windows using the same machine as client/daemon. (I don't have 2 windows machines with Infiniband here)
----- Original Message -----
From: Tzachi Dar
To: Diego Guella ; Fab Tillier
Cc: ofw at lists.openfabrics.org
Sent: Tuesday, November 13, 2007 12:59 PM
Subject: RE: Completion with bad status: IBV_WC_RETRY_EXC_ERROR
Hi Diego,
Can you give us more information on where the error is?
Does your problem happen on the first packet that you send or only after some time?
Are you using send/recv or RDMA operations?
Thanks
Tzachi
------------------------------------------------------------------------
From: Diego Guella [mailto:diego.guella at sircomtech.com]
Sent: Tuesday, November 13, 2007 11:29 AM
To: Diego Guella; Tzachi Dar; Fab Tillier
Cc: ofw at lists.openfabrics.org
Subject: Completion with bad status: IBV_WC_RETRY_EXC_ERROR
Hi
I'm trying to port the source example in Mellanox\WinIB\SDK\examples\hello_world to Linux.
The problem I have is this: after I post the WR, I get completion with error:
status = 12 (IBV_WC_RETRY_EXC_ERROR) on Linux;
status = IB_WCS_TIMEOUT_RETRY_ERR on Windows.
What does this error means? I'm not sure about it.
Perhaps am I sending to the wrong LID? to the wrong QPN??
Any help is appreciated.
Thanks,
Diego
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openfabrics.org/pipermail/ofw/attachments/20071113/6836e331/attachment.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: winib_1.3_tests.zip_
Type: application/octet-stream
Size: 121923 bytes
Desc: not available
URL: <http://lists.openfabrics.org/pipermail/ofw/attachments/20071113/6836e331/attachment.obj>
More information about the ofw
mailing list