[ofa-general] ibnetdiscover

Bernd Schubert bs at q-leap.de
Wed Aug 8 02:37:30 PDT 2007


Hello Hal,

thanks for your help.

On Tuesday 07 August 2007 14:36:09 Hal Rosenstock wrote:
> On 8/7/07, Bernd Schubert <bs at q-leap.de> wrote:
> > Hi,
> >
> > I two questions about ibnetdiscover.
> >
> > 1) How reliable is it? Here in our testing lab ibnetdiscover works fine,
> > detects the proper names of the two infiniband switches and all connected
> > client cards. On a customer system it doesn't work that well. I know the
> > client is connected to an MTS2400 switch, but ibnetdiscover detects a
> > MT47396. The MTS2400 is connected to MTS1400 switch, but again
> > ibnetdiscover believs it is again a MT47396. Any idea whats going on?
>
> Where is MT47396 being displayed ? ibnetdiscover displays the
> NodeDescription and perhaps that is not set properly on those
> switches. You can verify this with smpquery nodedesc.

ibwarn: [30449] handle_port: NodeInfo on DR path 0,1,12,21,16,1 failed, skipping port
#
# Topology file: generated on Tue Aug  7 13:51:41 2007
#
# Max of 5 hops discovered
# Initiated from node 0002c90200401338 port 0002c90200401339

vendid=0x2c9
devid=0xb924
sysimgguid=0xb8cffff0024ef
switchguid=0xb8cffff0024ef
Switch  24 "S-000b8cffff0024ef"         # "MT47396 Infiniscale-III Mellanox Technologies" base port 0 lid 154 lmc 0
[24]    "S-0002c9010befe970"[2]         # "MT47396 Infiniscale-III Mellanox Technologies" lid 138
[23]    "S-0002c9010befe970"[1]         # "MT47396 Infiniscale-III Mellanox Technologies" lid 138
[22]    "H-0002c902004013c0"[1]         # "MT23108 InfiniHost Mellanox Technologies" lid 4
[...]


ha-beo-2:/tmp/ofed/leuven# smpquery -vvvv -G switchinfo 0xb8cffff0024ef
# Switch info: Lid 154
LinearFdbCap:....................49152
RandomFdbCap:....................0
McastFdbCap:.....................1024
LinearFdbTop:....................321
DefPort:.........................0
DefMcastPrimPort:................0
DefMcastNotPrimPort:.............0
LifeTime:........................18
StateChange:.....................0
LidsPerPort:.....................0
PartEnforceCap:..................32
InboundPartEnf:..................1
OutboundPartEnf:.................1
FilterRawInbound:................1
FilterRawInbound:................1
EnhancedPort0:...................0


Sorry, but I have no idea how the output of smpquery should help me.

>
> > 2) ibnetdiscover also can't detect everything on the very same customer
> > system, it shows this error
> >
> > ibwarn: [30449] handle_port: NodeInfo on DR path 0,1,12,21,16,1 failed,
> > skipping port
> >
> > Does this mean port 1 of the last switch failed?
>
> It means that the peer port of port 1's SMA failed to respond to the
> SubnGet NodeInfo. What is connected there and what state is it in ?

Unfortunately I have no idea. The system is not located near to us and its 
rather difficult to ask our customer, since ibnetdiscover shows the wrong 
switch names. I will try anyway later on this day.

>
> > I'm also not sure about the pathes, IMHO the man page of ibnetdiscover
> > should give one more example, so
> >
> >
> >
> >       -D      use directed path address arguments. The path
> >               is a comma separated list of out ports.
> >               Examples:
> >               "0"             # self port
> >               "0,1,2,1,4"     # out via port 1, then 2, ...
> >
> >
> > "out via port 1, then out via port 2, then out via port 1, ..."
> >
> > or
> >
> > "out via port 1, then in port 2, then out via port 1, ..."
> >
> >
> > You see what I mean?
>
> It's the former. It's the out port on each hop along the path.

Thanks, do you mind to apply this patch? I know the text already says 
"separated list of out ports", but for those who don't believe it
like me ;) the comment will make it a bit more convincing.


--- ./ofa_user-1.2.orig/src/userspace/management/diags/man/ibnetdiscover.8      2007-06-21 16:39:17.000000000 +0200
+++ ./ofa_user-1.2/src/userspace/management/diags/man/ibnetdiscover.8   2007-08-08 11:15:33.000000000 +0200
@@ -66,7 +66,7 @@
         is a comma separated list of out ports.
         Examples:
         "0"             # self port
-        "0,1,2,1,4"     # out via port 1, then 2, ...
+        "0,1,2,1,4"     # out via port 1, then out via port 2, ...
 .PP
 \-G      use GUID address argument. In most cases, it is the Port GUID.
         Example:


Thanks again,
Bernd


-- 
Bernd Schubert
Q-Leap Networks GmbH



More information about the general mailing list