[openib-general] IB initialization

Chris Worley worleys at gmail.com
Fri Apr 14 09:05:06 PDT 2006


Hal,

Note that I got an /etc/init.d/openibd script that's getting
everything running (I still don't have IPoIB or MVAPICH2... but I can
live without both).

Now, I'm running Opensm with -V, and it looks as I expected.

This cluster is simple: 9 nodes in one switch.

Thanks,

Chris
On 14 Apr 2006 11:38:21 -0400, Hal Rosenstock <halr at voltaire.com> wrote:
> Hi again Chris,
>
> On Fri, 2006-04-14 at 11:29, Chris Worley wrote:
> > Hal,
> >
> > It looks like 1 per GUID.  I don't see a capability mask.  An example is:
> >
> > Apr 14 07:28:18 879428 [40602960] -> __osm_trap_rcv_process_request:
> > Received Generic Notice type:0x04 num:144 Producer:1 f
> > rom LID:0x0007 TID:0x0000000000000001
> > Apr 14 07:28:18 879513 [40602960] -> osm_report_notice: Reporting
> > Generic Notice type:4 num:144 from LID:0x0007 GID:0xfe800
> > 00000000000,0x0002c9020020c3b6
>
> Are you running with verbose (-V) ? You only see that extra info then.
>
> Just out of curiousity, how big is your subnet and what is the topology
> ?
>
> -- Hal
>
> > Thanks,
> >
> > Chris
> > On 14 Apr 2006 10:55:00 -0400, Hal Rosenstock <halr at voltaire.com> wrote:
> > > Hi again Chris,
> > >
> > > On Fri, 2006-04-14 at 10:39, Chris Worley wrote:
> > > > Hal,
> > > >
> > > > You're correct... the results of the scans are in /var/log/osm.log.  I
> > > > was expecting the "-console" mode to show more.
> > > >
> > > > In looking at the /var/log/osm.log I'm seeing a lot of:
> > > >
> > > > Reporting Generic Notice type:4 num:144
> > > >
> > > > For different GUIDs.
> > >
> > > What's a lot ? One for each GUID ? What's the capability mask indicated
> > > ?
> > >
> > > >   Is there a place to look these up?
> > >
> > > Yes, the IBA spec (volume 1). Trap 144 indicates that the capability
> > > mask at the indicated LID has changed.
> > >
> > > > I still don't have IPoIB running, and ibv_devinfo says I'm not setup
> > > > right either (couldn't open a device).
> > >
> > > I'm not sure why not.
> > >
> > > -- Hal
> > >
> > > > Thanks,
> > > >
> > > > Chris
> > > > On 14 Apr 2006 10:22:22 -0400, Hal Rosenstock <halr at voltaire.com> wrote:
> > > > > Hi Chris,
> > > > >
> > > > > On Fri, 2006-04-14 at 10:19, Chris Worley wrote:
> > > > > > I installed the SuSE 10 OpenIB RC2 RPMS.
> > > > > >
> > > > > > The installation went well, but I'm stuck at the startup.
> > > > > >
> > > > > > As an IBGD user, I'm used to an init file in /etc/init.d... but there was none.
> > > > > >
> > > > > > >From the wiki, I was able to glean:
> > > > > >
> > > > > >             Make the udev file:
> > > > > >
> > > > > > # cat > /etc/udev/rules.d/40-infiniband.rules
> > > > > > KERNEL="umad*", NAME="infiniband/%k"
> > > > > > KERNEL="issm*", NAME="infiniband/%k"
> > > > > >
> > > > > >              Install some modules:
> > > > > >
> > > > > > modprobe ib_ucm
> > > > > > modprobe ib_cm
> > > > > > modprobe ib_uverbs
> > > > > > modprobe ib_umad
> > > > > >
> > > > > > And make sure udev is running, and start the opensm.
> > > > > >
> > > > > > I've done this on all nodes, and ibstat shows I have a link up and
> > > > > > running on every node.  Opensm doesn't show any scanning.  It's been
> > > > > > hung all night at:
> > > > > >
> > > > > > # opensm --console
> > > > > > -------------------------------------------------
> > > > > > OpenSM Rev:openib-1.2.0
> > > > > > Based on OpenIB svn Exported revision
> > > > > > Command Line Arguments:
> > > > > >  Enabling OpenSM interactive console
> > > > > >  Log File: /var/log/osm.log
> > > > > > -------------------------------------------------
> > > > > > OpenSM Rev:openib-1.2.0 OpenIB svn Exported revision
> > > > > >
> > > > > > Using default guid 0x2c9020020c3ce
> > > > > >
> > > > > > OpenSM Console
> > > > > >
> > > > > > $ Entering MASTER state
> > > > > >
> > > > > > SUBNET UP
> > > > >
> > > > > Looks like everything is fine from the OpenSM standpoint.
> > > > >
> > > > > I see no indication that OpenSM is hung. You are in the console.
> > > > >
> > > > > Also, why do you say OpenSM isn't "scanning" ?
> > > > >
> > > > > What is in /var/log/osm.log ? Any errors ?
> > > > >
> > > > > If you want more verbose messages start OpenSM with -V.
> > > > >
> > > > > -- Hal
> > > > >
> > > > > > IPoIB isn't up.  ibv_rc_pingpong doesn't work.  Neither does ibv_devinfo.
> > > > > >
> > > > > > Is there a definitive guide on the initialization of the drivers and fabric?
> > > > > >
> > > > > > Also, is there an MVAPICH2 for SuSE 10 RPM?
> > > > > >
> > > > > > Thanks,
> > > > > >
> > > > > > Chris
> > > > > > _______________________________________________
> > > > > > openib-general mailing list
> > > > > > openib-general at openib.org
> > > > > > http://openib.org/mailman/listinfo/openib-general
> > > > > >
> > > > > > To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general
> > > > >
> > > > >
> > > > _______________________________________________
> > > > openib-general mailing list
> > > > openib-general at openib.org
> > > > http://openib.org/mailman/listinfo/openib-general
> > > >
> > > > To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general
> > >
> > >
> > _______________________________________________
> > openib-general mailing list
> > openib-general at openib.org
> > http://openib.org/mailman/listinfo/openib-general
> >
> > To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general
>
>



More information about the general mailing list