[ewg] Re: OFED 1.2 beta blocking bugs

Sean Hefty mshefty at ichips.intel.com
Thu Mar 8 11:13:49 PST 2007


>From a quick look at the code, it does look like there are some races
> in ipoib_multicast.c.  The place where a QP is actually attached to a
> group is essentially (trimming debug prints):
> 
> 		if (test_and_set_bit(IPOIB_MCAST_FLAG_ATTACHED, &mcast->flags))
> 			return 0;
> 
> 		ret = ipoib_mcast_attach(dev, be16_to_cpu(mcast->mcmember.mlid),
> 					 &mcast->mcmember.mgid);
> 
> and the place where a QP is detached is:
> 
> 	if (test_and_clear_bit(IPOIB_MCAST_FLAG_ATTACHED, &mcast->flags)) {
> 		ret = ipoib_mcast_detach(dev, be16_to_cpu(mcast->mcmember.mlid),
> 					 &mcast->mcmember.mgid);

Going back to 2.6.20 (pre-multicast changes), this area of the code looks like 
it has the same race.  Was IPoIB HA testing done on 2.6.20 or earlier versions 
of the code, and if so, were any issues found?  (I'm not sure we've found all of 
the problems yet.)

- Sean




More information about the ewg mailing list