[ofa-general] Re: [PATCH v2] infiniband-diags/scripts: Add 'ibcheckspeed' and 'ibcheckportspeed' to scripts

Hal Rosenstock hal.rosenstock at gmail.com
Mon Sep 14 11:51:03 PDT 2009


On Mon, Sep 14, 2009 at 2:02 PM, Ira Weiny <weiny2 at llnl.gov> wrote:

> On Fri, 11 Sep 2009 09:32:39 +0530
> Keshetti Mahesh <keshetti.mahesh at gmail.com> wrote:
>
> > My badness. I have not used 'iblinkinfo' before.
> > So, I guess there is no need for the above script. Apart from that, I
> feel
> > there should be a program/script which will first scan the fabric to find
> the
> > maximum common supported width/speed and then report the warning messages
> > of the links/ports which are configured with active width/speed less
> > than the found
> > value. Is there any tool already exists which does the same ?
>
> Not that I know of.
>

ibportstate does this but is on a per port basis. This could be readily
scripted (ad hoc or in tree) for this purpose.

-- Hal

<snip...>



>
> While I could see the usefulness of such a tool in some environments I have
> gone down the path of making the OFED diags more generic and then writing
> some wrappers for our local needs.  Currently I have a script which runs
> iblinkinfo with the "-l" option and then returns total number of links at
> SDR, DDR, QDR as well as the number of links at 1, 4, or 12X.  I then leave
> it up to the sys admin to know if their cluster is homo or heterogenious and
> how many links should be at what speeds.  They can then use iblinkinfo to
> identify which links are incorrect for their particular installation.
>
> Ira
>
> >
> > -
> > Keshetti Mahesh
> >
> > On Thu, Sep 10, 2009 at 9:32 PM, Ira Weiny <weiny2 at llnl.gov> wrote:
> > > Also, iblinkinfo will report links which it finds capable of either
> faster or wider operation.  iblinkinfo checks both ends of the link as Hal
> mentions.  It reports this with output like.
> > >
> > > Switch 0x0005ad0000092106 Cisco Switch SFS7000D:
> > > ...
> > >           7    8[  ] ==( 4X 2.5 Gbps Active/  LinkUp)==>       8   12[
>  ] "MT47396 Infiniscale-III Mellanox Technologies" ( Could be 5.0 Gbps)
> > > ...
> > >
> > > Also the portstatus console command in OpenSM will report links which
> are running at "reduced speed or width".  Although this does not check the
> remote port.
> > >
> > > OpenSM $ help portstatus
> > > portstatus [ca|switch|router]
> > > summarize port status
> > >   [ca|switch|router] -- limit the results to the node type specified
> > > OpenSM $ portstatus
> > > "ALL" port status:
> > >   115 port(s) scanned on 9 nodes in 26 us
> > >   85 down
> > >   30 active
> > >   32 at 4X
> > >   22 at 2.5 Gbps
> > >   8 at 5.0 Gbps
> > >   2 at 10.0 Gbps
> > >
> > > Possible issues:
> > >   2 disabled
> > >      0x0008f10400411b18 5 (ISR9024D Voltaire)
> > >      0x0005ad0000092106 13 (Cisco Switch SFS7000D)
> > >   6 with reduced speed
> > >      0x0008f10500200220 33 (Voltaire 4036 - 36 QDR ports switch)
> > >      0x0008f10500200220 19 (Voltaire 4036 - 36 QDR ports switch)
> > >      0x0005ad0000092106 21 (Cisco Switch SFS7000D)
> > >      0x0005ad0000092106 20 (Cisco Switch SFS7000D)
> > >      0x0005ad0000092106 9 (Cisco Switch SFS7000D)
> > >      0x0005ad0000092106 8 (Cisco Switch SFS7000D)
> > >
> > >
> > > Ira
> > >
> > > On Thu, 10 Sep 2009 09:23:35 -0400
> > > Hal Rosenstock <hal.rosenstock at gmail.com> wrote:
> > >
> > >> On Thu, Sep 10, 2009 at 9:02 AM, Keshetti Mahesh
> > >> <keshetti.mahesh at gmail.com>wrote:
> > >>
> > >> > Added 'ibcheckspeed' and 'ibcheckportspeed': Similar to
> > >> > 'ibcheckwidth/ibcheckportwidth' in functionality and implementation.
> > >> > Reports error/warning messages if the LinkSpeedActive is configured
> as
> > >> > 2.5 Gbps when the LinkSpeedSupported is more than 2.5 Gbps.
> > >> >
> > >>
> > >> ibportstate checks for more than this in terms of speed (and width)
> > >> anomalies.
> > >>
> > >> Would it be better for these scripts to use that tool now ?
> Alternatively,
> > >> the additional speed/width anomaly checks could be implemented in
> these
> > >> scripts but it does involve checking the peer port so there's a little
> more
> > >> to it.
> > >>
> > >> -- Hal
> > >>
> > >>
> > >> >
> > >> > Signed-off-by: Keshetti Mahesh < keshetti.mahesh at gmail.com>
> > >> > ---
> > >> >  infiniband-diags/scripts/ibcheckportspeed.in |  146
> > >> > ++++++++++++++++++++++++++
> > >> >  infiniband-diags/scripts/ibcheckportwidth.in |    2 +-
> > >> >  infiniband-diags/scripts/ibcheckspeed.in     |  135
> > >> > ++++++++++++++++++++++++
> > >> >  3 files changed, 282 insertions(+), 1 deletions(-)
> > >> >  create mode 100644 infiniband-diags/scripts/ibcheckportspeed.in
> > >> >  create mode 100644 infiniband-diags/scripts/ibcheckspeed.in
> > >> >
> > >> <snip...>
> > >>
> > >
> > >
> > > --
> > > Ira Weiny
> > > Math Programmer/Computer Scientist
> > > Lawrence Livermore National Lab
> > > 925-423-8008
> > > weiny2 at llnl.gov
> > >
>
>
> --
>  Ira Weiny
> Math Programmer/Computer Scientist
> Lawrence Livermore National Lab
> 925-423-8008
> weiny2 at llnl.gov
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openfabrics.org/pipermail/general/attachments/20090914/2a9fb863/attachment.html>


More information about the general mailing list