[libfabric-users] Transport retry counter exceeded
Hefty, Sean
sean.hefty at intel.com
Tue Sep 27 10:29:47 PDT 2016
> I've written an server/client using verbs provider. Randomly I get the
> error.
>
> transport retry counter exceeded
>
>
> Sometimes the program works perfectly and sometimes it slows down and
> gives out the above error. Any particular reason this happens?
I would look at how quickly receive buffers are being re-posted. If the receive queue gets overrun and receives are not re-posted, this could occur. I'm assuming that there's not some hardware or link issue.
- Sean
More information about the Libfabric-users
mailing list