[openib-general] IPoIB still not working

Woodruff, Robert J robert.j.woodruff at intel.com
Tue Dec 7 17:11:37 PST 2004


 
Here are some log files.

First file, mcast-64.log is the /var/log/messages output 
from the patch you sent on the 64-bit system.

Next log files is the opensm log file 
osm-64bit.log

Next log file is the opensm log file when running the 32-node.
osm-32-bit.log


In the passing case, ipoib sends 2 MCM messages and opensm has no
complaints.
Search for MCMember Record in osm-32-bit.log

In the failing case, ipoib sends 2 MCM messages that look similar with
no errors
reported. However, in the failing case ipoib continues to send MCM
messages
that opensm rejects. In the failing case there are a couple of 
differences, first the MGID lower 32-bits appear to be 0xffffffff in the
passing case and something else when it fails. 
Second, it appears that perhaps the opensm is rejecting the messages
because
of a bug where the scope and join fields are reversed when extracted
from
the mad. In the passing case, since the lower 32 bits of the mgid are
0xfffffffff,
you never get to the code that checks the join member. 
Someone that understands opensm should look at this, but Sean
I think it may be wrong.

This however does not explain why in the failing case, ipoib continues
to 
try to join the mcast group unless it is having difficulties after
trying yo 
join he group and decides to re-try, with the subsequent re-tries to 
join being failed by opensm.

-------------- next part --------------
A non-text attachment was scrubbed...
Name: osm-32bit.log
Type: application/octet-stream
Size: 2781897 bytes
Desc: osm-32bit.log
URL: <http://lists.openfabrics.org/pipermail/general/attachments/20041207/365a3d06/attachment.obj>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: osm-64bit.log
Type: application/octet-stream
Size: 387066 bytes
Desc: osm-64bit.log
URL: <http://lists.openfabrics.org/pipermail/general/attachments/20041207/365a3d06/attachment-0001.obj>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: mcast-64.log
Type: application/octet-stream
Size: 50359 bytes
Desc: mcast-64.log
URL: <http://lists.openfabrics.org/pipermail/general/attachments/20041207/365a3d06/attachment-0002.obj>


More information about the general mailing list