[openib-general] [PATCH ] RFC IB/cm do not track remote QPN in timewait state
Sean Hefty
mshefty at ichips.intel.com
Wed Aug 30 15:40:41 PDT 2006
Michael S. Tsirkin wrote:
> It exposed a race in SDP. The patch itself does not lead to crashes -
> I re-attach it here for reference.
> As we discussed, this needs to be extended to handle DREQ retries
> properly.
I've committed this patch to SVN 9193.
The CM should already handle DREQ retries properly; however...
The CM timeout for a response can end up being close, or the same as the
timewait time. (All of my test apps will result in these values being the
same.) If a DREP is lost, the side that received the DREQ may enter and exit
timewait before the DREQ is retried. In this situation, the DREQ gets dropped
repeatedly.
We will want to queue this patch for 2.6.19, if you can point Roland to your git
tree.
Acked-by: Sean Hefty <sean.hefty at intel.com>
More information about the general
mailing list