[ewg] Re: [ofa-general] soft lockup in the kernel mad layer

Jack Morgenstein jackm at dev.mellanox.co.il
Tue Jul 1 05:40:33 PDT 2008


On Tuesday 01 July 2008 12:44, Or Gerlitz wrote:
> Or Gerlitz wrote:
> > doing some tests against some nodes with new HCA firmware (connectx FW 2.5) which seems to be very slow responding on node info queries, I think that I have stepped on a bug/s in the kernel mad code The IB bits used on this node are not the mainline kernel ones but rather
> > git://git.openfabrics.org/ofed_1_3/linux-2.6.git ofed_kernel
> > commit 564e9e9383272f4311fd87ff4e5447cfcebad73a
> >
> Jack, Vlad
> 
> Looking now on the ofed_1_3/linux-2.6.git tree, I don't see the below 
> commit there, am I correct?
> 
> Is it because the fix was pushed to the kernel after the "feature 
> freeze" of ofed 1.3 but not into ofed
> since you don't pick all the fixes that get into the kernel during an 
> ofed cycle?
> 
> Or.
> > commit b61d92d8ae6aa13b17d1c31e69d123879cec2ee2
> > Author: Sean Hefty <sean.hefty at intel.com>
> > Date:   Fri Nov 30 17:30:18 2007 -0800
> >
> >     IB/mad: Fix incorrect access to items on local_list
> >     

You are correct (we missed this patch probably due to the fact that it was so close to the 1.3 release, which was based on kernel 2.6.24).
It is already in the OFED 1.4 tree. I'll add it to the OFED 1.3 tree, so that it will be included in any future OFED 1.3 releases.
(Note that Roland asked Linus to pull this patch for the kernel 2.6.25 tree on Jan 25, 2008:
http://lists.openfabrics.org/pipermail/general/2008-January/045492.html
)

- Jack



More information about the ewg mailing list