[ofa-general] IPOIB/CM increase retry counts

Eli Cohen eli at dev.mellanox.co.il
Tue Feb 12 12:36:55 PST 2008


Sean Hefty wrote:
>> Saying all that, I don't think we want to have --any RNR retries--, as
>> for retries, I am open to hear what others think.
> 
> I'm really not all that familiar with ipoib protocol, but if it's being
> implemented over an RC connection, then adding an RNR retry seems to make sense
> to me.  I believe using UC is better, but if it's over RC, I don't know that we
> want to take the hit of tearing down and re-establishing the connection just
> because we have a fast sender.  (This is just an opinion based on no fact
> whatsoever.)
> 

I don't see why setting rnr retry count can help if we have a fast sender. If 
this sender is faster than the receiver eventually the rnr counter will expire
and the connection will close.

As for retry count, I don't know how common are errors that contribute to the 
retry counter. If anyone has statistics of this I'd be glad to know.

Pradeep, can you tell identify what part of the patch you sent actually solved 
the problem you were seeing and also give some description of the problem?



More information about the general mailing list