[openib-general] problems with lustre o2ib module & ofed

Thierry Delaitre delaitt at cpc.wmin.ac.uk
Mon Sep 25 01:12:01 PDT 2006


On Mon, 25 Sep 2006, Michael S. Tsirkin wrote:

> Quoting r. Thierry Delaitre <delaitt at cpc.wmin.ac.uk>:
> > Subject: Re: problems with lustre o2ib module & ofed
> >
> >
> > On Mon, 25 Sep 2006, Or Gerlitz wrote:
> >
> > > Jack Morgenstein wrote:
> > > > Did you recompile Lustre following the installation of ofed-1.1?
> > > > I'm not familiar with the Lustre installation procedure (i.e., if it
> > > > gets compiled on the current host).  If yes, you probably merely need
> > > > to uninstall and reinstall Lustre o2ib.
> > >
> > > OK, can we state clearly what's the user needs to do with modules
> > > directly dependent on ofed symbols (eg Lustre's o2ib, NFSoRDMA, RDS and
> > > hopefully more to come).
> > >
> > > Is it recompile / uninstall / install ???
> >
> > The issue is about the installation of Lustre 1.5.95 o2ib with OFED-1.1rc6
> > for SLES10.
> >
> > ofed-1.1-rc6 compiles nicely as shown below. The ib kernel modules all
> > resides under /lib/modules/2.6.16.21-0.8-smp/kernel/drivers/infiniband/
> > and do match the ones compiled by ofed. I have tried these steps several
> > times.
> >
> > n32:~ # lsmod | grep ib
> > libcfs                103060  1 lnet
> > ib_ucm                 19332  0
> > ib_addr                10756  1 rdma_cm
> > ib_cm                  31968  2 ib_ucm,rdma_cm
> > ib_ipoib               48144  0
> > ib_sa                  16652  3 rdma_cm,ib_cm,ib_ipoib
> > ib_uverbs              38312  2 rdma_ucm,ib_ucm
> > ib_umad                17968  0
> > ib_mthca              116240  0
> > ib_mad                 36116  4 ib_cm,ib_sa,ib_umad,ib_mthca
> > ib_core                49024  9
> > ib_ucm,rdma_cm,ib_cm,ib_ipoib,ib_sa,ib_uverbs,ib_umad,ib_mthca,ib_mad
> >
> > I compiled lustre for the above kernel and ofed installation. I get the
> > following when doing a 'lctl network up' in lustre. I have modversion set
> > to on in the kernel. If i set it to 'n' then i get a null pointer
> > exception and the module crashes.
> >
> > ko2iblnd: disagrees about version of symbol ib_create_cq
> > ko2iblnd: Unknown symbol ib_create_cq
>
> don't know anything about lustre, but note you must
> point build to pick up headers from
> /usr/local/ofed/src/openib/include/
> *before* the built-in header includes.

I've set the o2ib path to /usr/local/ofed/src/openib-1.1 as shown in the
lustre's configure line below. Lustre's configure script looks for a
driver/infiniband directory which only seems to exist under
/usr/local/ofed/src/openib-1.1

./configure --with-o2ib=/usr/local/ofed/src/openib-1.1/

Thierry.

> replace /usr/local/ofed with the prefix you specified.

> --
> MST




More information about the general mailing list