[libfabric-users] Transport retry counter exceeded

Hefty, Sean sean.hefty at intel.com
Tue Sep 27 10:29:47 PDT 2016


> I've written an server/client using verbs provider. Randomly I get the
> error.
> 
> transport retry counter exceeded
> 
> 
> Sometimes the program works perfectly and sometimes it slows down and
> gives out the above error. Any particular reason this happens?

I would look at how quickly receive buffers are being re-posted.  If the receive queue gets overrun and receives are not re-posted, this could occur.  I'm assuming that there's not some hardware or link issue.

- Sean


More information about the Libfabric-users mailing list