[ofa-general] Re: [PATCH v2] infiniband-diags/scripts: Add 'ibcheckspeed' and 'ibcheckportspeed' to scripts

Barry Mavin Barry.Mavin at recital.com
Thu Sep 10 22:38:15 PDT 2009


I use mellanox connectx cards and switches in a cluster.

When I try and use ibtracert I get this output.

# ibtracert 10.10.10.1 10.10.10.3
ibwarn: [6998] _do_madrpc: recv failed: Connection timed out
ibwarn: [6998] mad_rpc: _do_madrpc failed; dport (Lid 10)
ibwarn: [6998] find_route: can't reach to/from ports
ibtracert: iberror: failed: can't find a route to the src port

Does anyone have any idea why this would be happening?

---
Regards
Barry Mavin
Recital Corporation



> From: Keshetti Mahesh <keshetti.mahesh at gmail.com>
> Date: Fri, 11 Sep 2009 09:32:39 +0530
> To: Ira Weiny <weiny2 at llnl.gov>
> Cc: OFED mailing list <linux-rdma at vger.kernel.org>, OFED mailing list
> <general at lists.openfabrics.org>
> Subject: Re: [ofa-general] Re: [PATCH v2] infiniband-diags/scripts: Add
> 'ibcheckspeed' and 'ibcheckportspeed' to scripts
> 
> My badness. I have not used 'iblinkinfo' before.
> So, I guess there is no need for the above script. Apart from that, I feel
> there should be a program/script which will first scan the fabric to find the
> maximum common supported width/speed and then report the warning messages
> of the links/ports which are configured with active width/speed less
> than the found
> value. Is there any tool already exists which does the same ?
> 
> -
> Keshetti Mahesh
> 
> On Thu, Sep 10, 2009 at 9:32 PM, Ira Weiny <weiny2 at llnl.gov> wrote:
>> Also, iblinkinfo will report links which it finds capable of either faster or
>> wider operation.  iblinkinfo checks both ends of the link as Hal mentions.
>>  It reports this with output like.
>> 
>> Switch 0x0005ad0000092106 Cisco Switch SFS7000D:
>> ...
>>           7    8[  ] ==( 4X 2.5 Gbps Active/  LinkUp)==>       8   12[  ]
>> "MT47396 Infiniscale-III Mellanox Technologies" ( Could be 5.0 Gbps)
>> ...
>> 
>> Also the portstatus console command in OpenSM will report links which are
>> running at "reduced speed or width".  Although this does not check the remote
>> port.
>> 
>> OpenSM $ help portstatus
>> portstatus [ca|switch|router]
>> summarize port status
>>   [ca|switch|router] -- limit the results to the node type specified
>> OpenSM $ portstatus
>> "ALL" port status:
>>   115 port(s) scanned on 9 nodes in 26 us
>>   85 down
>>   30 active
>>   32 at 4X
>>   22 at 2.5 Gbps
>>   8 at 5.0 Gbps
>>   2 at 10.0 Gbps
>> 
>> Possible issues:
>>   2 disabled
>>      0x0008f10400411b18 5 (ISR9024D Voltaire)
>>      0x0005ad0000092106 13 (Cisco Switch SFS7000D)
>>   6 with reduced speed
>>      0x0008f10500200220 33 (Voltaire 4036 - 36 QDR ports switch)
>>      0x0008f10500200220 19 (Voltaire 4036 - 36 QDR ports switch)
>>      0x0005ad0000092106 21 (Cisco Switch SFS7000D)
>>      0x0005ad0000092106 20 (Cisco Switch SFS7000D)
>>      0x0005ad0000092106 9 (Cisco Switch SFS7000D)
>>      0x0005ad0000092106 8 (Cisco Switch SFS7000D)
>> 
>> 
>> Ira
>> 
>> On Thu, 10 Sep 2009 09:23:35 -0400
>> Hal Rosenstock <hal.rosenstock at gmail.com> wrote:
>> 
>>> On Thu, Sep 10, 2009 at 9:02 AM, Keshetti Mahesh
>>> <keshetti.mahesh at gmail.com>wrote:
>>> 
>>>> Added 'ibcheckspeed' and 'ibcheckportspeed': Similar to
>>>> 'ibcheckwidth/ibcheckportwidth' in functionality and implementation.
>>>> Reports error/warning messages if the LinkSpeedActive is configured as
>>>> 2.5 Gbps when the LinkSpeedSupported is more than 2.5 Gbps.
>>>> 
>>> 
>>> ibportstate checks for more than this in terms of speed (and width)
>>> anomalies.
>>> 
>>> Would it be better for these scripts to use that tool now ? Alternatively,
>>> the additional speed/width anomaly checks could be implemented in these
>>> scripts but it does involve checking the peer port so there's a little more
>>> to it.
>>> 
>>> -- Hal
>>> 
>>> 
>>>> 
>>>> Signed-off-by: Keshetti Mahesh < keshetti.mahesh at gmail.com>
>>>> ---
>>>>  infiniband-diags/scripts/ibcheckportspeed.in |  146
>>>> ++++++++++++++++++++++++++
>>>>  infiniband-diags/scripts/ibcheckportwidth.in |    2 +-
>>>>  infiniband-diags/scripts/ibcheckspeed.in     |  135
>>>> ++++++++++++++++++++++++
>>>>  3 files changed, 282 insertions(+), 1 deletions(-)
>>>>  create mode 100644 infiniband-diags/scripts/ibcheckportspeed.in
>>>>  create mode 100644 infiniband-diags/scripts/ibcheckspeed.in
>>>> 
>>> <snip...>
>>> 
>> 
>> 
>> --
>> Ira Weiny
>> Math Programmer/Computer Scientist
>> Lawrence Livermore National Lab
>> 925-423-8008
>> weiny2 at llnl.gov
>> 
> _______________________________________________
> general mailing list
> general at lists.openfabrics.org
> http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general
> 
> To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general




More information about the general mailing list