[ofa-general] IPoIB connections falling off

Todd Bowman twbowman at gmail.com
Wed Jul 22 14:55:42 PDT 2009


I need a little direction to help solve an IPoIB issue.
Software: OFED 1.3 and 1.4 stacks, running OpenSM


Problem:
IPoIB connections fail, meaning a node cannot ping all or some of the other
IPoIB nodes.  IB itself is still up, we can run IB tests with success.  So
far the only resolution is to restart the IB stack.  Size of the cluster
seems to be irrelevant.  It has happened on clusters from around 64 to
1000s.


My first instinct is that some information has been lost from SM/SA which is
needed to create an IPoIB connection, but I'm not for sure what that
information is or how to verify that it is gone.

Thanks in advance,

Todd
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openfabrics.org/pipermail/general/attachments/20090722/68d4f921/attachment.html>


More information about the general mailing list