***SPAM*** Re: [ofa-general] OpenSM initialization error

Hal Rosenstock hal.rosenstock at gmail.com
Fri Mar 27 14:55:39 PDT 2009


On Fri, Mar 27, 2009 at 4:51 PM, Yicheng Jia <YJia at tmriusa.com> wrote:
>
> The "port_rcv_errors" attribute on OpenSM side of SM link is 65535, other
> errors attributes are all 0.

That's max'd out so should be cleared to see if it is continually incrementing.

Port Rcv Errors are the following:
• Local physical errors (ICRC, VCRC, LPCRC, and all physical
errors that cause entry into the BAD PACKET or BAD PACKET
DISCARD states of the packet receiver state machine)
• Malformed data packet errors (LVer, length, VL)
• Malformed link packet errors (operand, length, VL)
• Packets discarded due to buffer overrun

> How can I check SM link status on the switch side?

Get the portinfo on the SM peer port. Something like:

smpquery -D portinfo 0,1

assuming you are doing this from the SM node.

> Is there anything I can do if the SM link port on the switch side is
> not in init state?

Let's see if that's the case first.

> I don't think it's cable issue. I have 3 unmanaged switches on different
> system, all have the same problem.

So different cables are in use ? Same HCA for OpenSM though ?

-- Hal

> Thanks!
>
> Yicheng Jia
>
>
>
>
> Hal Rosenstock <hal.rosenstock at gmail.com>
>
> 03/27/2009 03:19 PM
>
> To
> Yicheng Jia <YJia at tmriusa.com>
> cc
> general at lists.openfabrics.org, hnrose at comcast.net
> Subject
> Re: [ofa-general] OpenSM initialization error
>
>
>
>
> On Fri, Mar 27, 2009 at 4:18 PM, Hal Rosenstock
> <hal.rosenstock at gmail.com> wrote:
>> On Fri, Mar 27, 2009 at 4:13 PM, Yicheng Jia <YJia at tmriusa.com> wrote:
>>>
>>> Yes. This persists for nearly 1 minute.
>>>
>>> One thing I notice is if I start mthca driver more than 1 minute before
>>> OpenSM starts, then there's no such error message anymore, this is gone.
>>> So
>>> I guess there's some handshaking going on between mthca driver and
>>> unmanaged
>>> switch firmware, which stablize communication before OpenSM starts. Do
>>> you
>>> have any idea about it?
>>
>> The physical link is brought up all on its own assuming the ports are
>> not disabled. Maybe the SM link being used is not in init yet. I don't
>> think that is well handled.
>
> What do the port counters say on both sides of the SM link ?
>
> Maybe try different cables on the SM link and see if this gets better.
>
> -- Hal
>
>> -- Hal
>>
>>> Thanks!
>>>
>>> Yicheng Jia
>>>
>>>
>>>
>>>
>>> Hal Rosenstock <hal.rosenstock at gmail.com>
>>>
>>> 03/27/2009 03:02 PM
>>>
>>> To
>>> Yicheng Jia <YJia at tmriusa.com>
>>> cc
>>> general at lists.openfabrics.org, hnrose at comcast.net
>>> Subject
>>> Re: [ofa-general] OpenSM initialization error
>>>
>>>
>>>
>>>
>>> On Fri, Mar 27, 2009 at 3:55 PM, Yicheng Jia <YJia at tmriusa.com> wrote:
>>>>
>>>> Here is the output of saquery. I am using Qlogic 9024 FC unmanaged
>>>> switch.
>>>> Only one SM exists in the subnet.
>>>
>>> <snip...>
>>>
>>> Is this topology simply one switch surrounded by HCAs ? If so, I don't
>>> have a theory as to why this message persists for 1 second.
>>>
>>> -- Hal
>>>
>>>
>>> _____________________________________________________________________________
>>> Scanned by IBM Email Security Management Services powered by MessageLabs.
>>> For more information please visit
>>> http://www.ers.ibm.com
>>>
>>> _____________________________________________________________________________
>>>
>>>
>>>
>>> _____________________________________________________________________________
>>> Scanned by IBM Email Security Management Services powered by MessageLabs.
>>> For more information please visit http://www.ers.ibm.com
>>>
>>> _____________________________________________________________________________
>>>
>>
>
> _____________________________________________________________________________
> Scanned by IBM Email Security Management Services powered by MessageLabs.
> For more information please visit http://www.ers.ibm.com
> _____________________________________________________________________________
>
>
> _____________________________________________________________________________
> Scanned by IBM Email Security Management Services powered by MessageLabs.
> For more information please visit http://www.ers.ibm.com
> _____________________________________________________________________________
>



More information about the general mailing list