[openib-general] Port error rate detection
Steven Carter
scarter at ornl.gov
Tue Feb 20 06:44:59 PST 2007
Hal Rosenstock wrote:
> On Mon, 2007-02-19 at 15:53, Steven Carter wrote:
>
>> I have a Nagios module that alerts on connectivity, port errors,
>> speed/width problems. I would like to give it the ability to change the
>> severity of the alert depending on whether errors are just present or if
>> they are increasing faster than a specified rate. The intent is to
>> equip the module to keep the state of the last query and possibly
>> history, but I wanted to make sure that I was not re-inventing the wheel
>> first. Is there an attribute or utility that I am overlooking that will
>> help me do this?
>>
>
> Not currently (to my knowledge). The thresholding of rate aspect is
> similat to what will be supported in the proposed PerfManager.
>
I noticed that in your RFC. How are you planning on presenting the data
to other agents (e.g. Nagios, Openview, MRTG, etc.)? One comment that I
should have made on your RFC is that I wonder if it is necessary to
include the data analysis/reduction part. Just having a central
location that collects the values and presents it via SNMP is extremely
useful since there are a plethora of monitoring apps (free and
commercial) that do what you are proposing. That way, a network
manager can leverage existing tools currently used for monitoring
Ethernet Nodes, Hosts, etc. You can still include a last change
attribute with each counter so that simple utilities (like the one that
I am writing) can get an idea of how quickly errors are occurring.
Steven.
> -- Hal
>
>
>> Thanks,
>>
>> Steven.
>>
>> _______________________________________________
>> openib-general mailing list
>> openib-general at openib.org
>> http://openib.org/mailman/listinfo/openib-general
>>
>> To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general
>>
>>
>
>
More information about the general
mailing list