[openib-general] RNR_RETRY_EXC_ERR and completion opcode in "send_lat"

Dotan Barak dotanb at dev.mellanox.co.il
Sat Dec 2 22:34:32 PST 2006


Hi Sayantan.
Sayantan Sur wrote:

>Hi,
>
>I have a question about the "status" field for a completion which is due
>to RNR retry exceeded error. I trivially modified the `send_lat' program
>(from the Gen2 perftest directory) to use SRQ and not post receives
>after some specified time. Given the "rnr_retry" attribute of the QP not
>to be 7 (infinite retry), I'm expecting the sender to get an erroneous
>completion with IBV_WC_RNR_RETRY_EXC_ERR.
>
>So far so good ... however, the completion I pull out of the send_cq,
>lists the opcode of the completion to be IBV_WC_RECV! Is this expected?
>
>I am using OFED 1.1 on dual Intel Xeon machines with Mellanox DDR HCAs
>(two ports) and in MemFree mode. The distribution used is RH AS4 (Nahant
>Update 3), with kernel version 2.6.17.7.
>
>If someone could explain this behavior, or suggest a workaround, it'd be
>great.
>
>TIA,
>Sayantan.
>  
>
I toke the following text from the man pages that i wrote to the libibverbs:
"Not all wc attributes are always valid. If the  completion  status  is
       other  than  IBV_WC_SUCCESS,  only the following attributes are 
valid:
       wr_id, status, qp_num, and vendor_err."

In other words, the opcode is not valid if you have a completion with error.

Thanks
Dotan




More information about the general mailing list