[openib-general] Re: ib0: ipoib_ib_post_receive failed for buf 111 ib0: failed to allocate receive buffer
Roland Dreier
rolandd at cisco.com
Thu Oct 13 17:54:36 PDT 2005
Helen> Not in realtime. My observations were made after the fact.
Helen> I supose I can launch another test and watch the cunter in
Helen> realtime if you believe that is necessary?
That might be interesting.
Assuming the HCA continues to work fine, and IPoIB recovers, the only
theory I can come up is that something is causing interrupts to be
held off for a long time, so the IPoIB driver doesn't get to see sends
completing. But I don't know what such a workload might be. Perhaps
something else you're running (Lustre?, iSCSI?) holds a lock for a
long time and causes the timeout. But it's not clear to me why the TX
watchdog would get to run if the interrupt handler doesn't get to run.
- R.
More information about the general
mailing list