[ofa-general] Both opensm's are in SMINFO_STANDBY and none of them claims master

Venkatesh Babu venkatesh.babu at 3leafnetworks.com
Mon May 21 23:31:24 PDT 2007



Hal Rosenstock wrote:

>
>Can you at least use OFED 1.2 management (OpenSM and management
>libraries) with the rest being OFED 1.1 ?
>  
>
 Are these backward compatible ?

>There are a number of bugs which have been fixed which might affect
>this. The one I can think of off the top of my head is a fix to atomics
>in OpenSM's complib. I think that was found and fixed post OFED 1.1.
>I'll confirm this tomorrow.
>
>There may also be some important kernel differences (in user_mad.c or
>mad.c) which might be relevant.
>  
>
  It would be great if you can find these particular patches, we could 
apply these onto OFED 1.1
instead of migrating to OFED 1.2.

  By the way, when is production quality OFED 1.2 is supposed to be 
released ?

>I was referring to using perfquery, not ibnetdiscover.
>  
>
 I don't have that output right now. But I found that all other error 
counters were zero except port_xmit_discards.

>  
>
>>ibwarn: [5895] handle_port: NodeInfo on DR path [0][1][9] port 9 failed,
>>skipping port
>>    
>>
>
>Was this node rebooting while you did this or is there some other issue
>?
>  
>
  Yes, it is quite possible that node was being rebooted.

>
>So run these (before and after):
>perfquery 12 18
>perfquery 12 11
>perfquery 12 10
>perfquery 12 19
>
>and
>
>perfquery 12 9
>  
>
  Unfortunately the systems got rebooted and issue is lost. I was able 
to collect the perfquery output. It looks like now it is seeing some errors.
[root at vortex3l-83 ~]# perfquery 12 9
# Port counters: Lid 12 port 9
PortSelect:......................9
CounterSelect:...................0x0100
SymbolErrors:....................65535
LinkRecovers:....................2
LinkDowned:......................255
RcvErrors:.......................1
RcvRemotePhysErrors:.............0
RcvSwRelayErrors:................41484
XmtDiscards:.....................4918
XmtConstraintErrors:.............0
RcvConstraintErrors:.............0
LinkIntegrityErrors:.............0
ExcBufOverrunErrors:.............0
VL15Dropped:.....................1
XmtBytes:........................2050081143
RcvBytes:........................4294967295
XmtPkts:.........................14539343
RcvPkts:.........................37028545
[root at vortex3l-83 ~]# perfquery 12 10
# Port counters: Lid 12 port 10
PortSelect:......................10
CounterSelect:...................0x0100
SymbolErrors:....................65535
LinkRecovers:....................27
LinkDowned:......................255
RcvErrors:.......................0
RcvRemotePhysErrors:.............0
RcvSwRelayErrors:................19936
XmtDiscards:.....................5192
XmtConstraintErrors:.............0
RcvConstraintErrors:.............0
LinkIntegrityErrors:.............0
ExcBufOverrunErrors:.............0
VL15Dropped:.....................0
XmtBytes:........................4294967295
RcvBytes:........................4294967295
XmtPkts:.........................1739931538
RcvPkts:.........................1794380558
[root at vortex3l-83 ~]# perfquery 12 11
# Port counters: Lid 12 port 11
PortSelect:......................11
CounterSelect:...................0x0100
SymbolErrors:....................65535
LinkRecovers:....................0
LinkDowned:......................255
RcvErrors:.......................1
RcvRemotePhysErrors:.............0
RcvSwRelayErrors:................8963
XmtDiscards:.....................5636
XmtConstraintErrors:.............0
RcvConstraintErrors:.............0
LinkIntegrityErrors:.............0
ExcBufOverrunErrors:.............0
VL15Dropped:.....................0
XmtBytes:........................4294967295
RcvBytes:........................4294967295
XmtPkts:.........................2375935494
RcvPkts:.........................2714377528
[root at vortex3l-83 ~]# perfquery 12 18
# Port counters: Lid 12 port 18
PortSelect:......................18
CounterSelect:...................0x0100
SymbolErrors:....................65535
LinkRecovers:....................24
LinkDowned:......................220
RcvErrors:.......................0
RcvRemotePhysErrors:.............0
RcvSwRelayErrors:................65535
XmtDiscards:.....................23628
XmtConstraintErrors:.............0
RcvConstraintErrors:.............0
LinkIntegrityErrors:.............0
ExcBufOverrunErrors:.............0
VL15Dropped:.....................0
XmtBytes:........................4294967295
RcvBytes:........................4294967295
XmtPkts:.........................604709394
RcvPkts:.........................448409077
[root at vortex3l-83 ~]# perfquery 12 19
# Port counters: Lid 12 port 19
PortSelect:......................19
CounterSelect:...................0x0100
SymbolErrors:....................65535
LinkRecovers:....................21
LinkDowned:......................247
RcvErrors:.......................0
RcvRemotePhysErrors:.............0
RcvSwRelayErrors:................65535
XmtDiscards:.....................37754
XmtConstraintErrors:.............0
RcvConstraintErrors:.............0
LinkIntegrityErrors:.............0
ExcBufOverrunErrors:.............0
VL15Dropped:.....................0
XmtBytes:........................4294967295
RcvBytes:........................4294967295
XmtPkts:.........................3958092428
RcvPkts:.........................3679343076
[root at vortex3l-83 ~]#

  -VBabu



More information about the general mailing list