[openib-general] OpenSM log growing too big

Hal Rosenstock halr at voltaire.com
Thu Nov 16 13:49:04 PST 2006


Hi Venkat,
 
See embedded <hnr>...</hnr> comments below.
 
-- Hal

________________________________

From: Venkatesh Babu [mailto:venkatesh.babu at 3leafnetworks.com]
Sent: Thu 11/16/2006 1:39 PM
To: Hal Rosenstock
Cc: openib-general at openib.org
Subject: Re: [openib-general] OpenSM log growing too big



Hal Rosenstock wrote:

> Not sure what question you are asking exactly.
>
> Is it what do those messages mean or the file getting large or both ?
> 
>
   Both. The message looks like LID 5 is generating too many events.

<hnr> Yes, LID 5 is  a switch LID and there is a port which is flapping. Bad cable ? </hnr>

The log file grows few MBs a second. What ever the problem with the port it
should not generate these many log messages. I guess it is a OpenSM bug.

<hnr> The code is reducing the messages which are similar (approx 128 traps).
The SM is repressing the trap and then the switch regenerates it becuase there is a port going up and down.
That issue should be resolved. There has been discussion on the list and patches on dealing with the log and limiting its size that are in more recent versions of OpenSM. I'll look at it to see if I can reduce these messages further. </hnr>


> What options are you using on OpenSM startup ?
> 
>
  root      7703  0.0  0.0 92784 1652 ?        Sl   05:00   0:01
/usr/bin/opensm -g 0x005045014ac20001 -p 11 -s 10 -u -f /var/log/opensm.log

> Also, any chance you can move forward on a more recent and better
> OpenSM ?
> 
>
 It is difficult to use OpenSM from OFED 1.1. Because we need to do
another QA verification cycle with our product.
But I can find the specific patch to the OpenSM I can apply that patch
to the existing OpenSM.

<hnr> I would highly recommend moving to OFED 1.1 OpenSM (from OFED 1.0). Many bugs have been fixed and it is much more robust. </hnr>

 VBabu

>
> -- Hal
> 







More information about the general mailing list