[openib-general] [PATCH 3/3] uDAPL cma: add support for address and route retries, call disconnect when recving dreq

Or Gerlitz ogerlitz at voltaire.com
Thu Nov 2 03:17:11 PST 2006


Arlin Davis wrote:
> Fix some timeout and long disconnect delay issues discovered during 
> scale-out testing. Added support to retry rdma_cm address and route 
> resolution with configuration options and provide a disconnect call when 
> receiving the disconnect request to force an immediate disconnect reply 
> to the remote side.

Can be very nice if you share with the community the IB stack issues 
revealed under scale-out testing... basically what was the testbed?

 From what the patch does I understand you attempt to handle timeout on 
address and route resolution and long disconnect delay.

Was the issue with address resolution being ARP request or reply 
messages getting lost?

Was the issue with route resolution being timeout on SA Path queries?

Please note that for the first two, you want to retry if the event 
status is -ETIMEDOUT, the patch ignores the status field.

Was the issue with disconnect delay that peer A called 
dat_ep_disconnect() (ie sending DREQ) and the DREP was sent only when 
peer B got the disconnect event and called dat_ep_disconnect()? so now 
the DREP is sent from within the provider code when it gets the DREQ?

Or.





More information about the general mailing list