[openib-general] question on opensm error

Hal Rosenstock halr at voltaire.com
Tue Feb 15 03:50:11 PST 2005


Hi Ron,

On Mon, 2005-02-14 at 15:59, Ronald G. Minnich wrote:
> formerly working opensm starts to get these:

So the OpenSM was up and running and these messages appeared in the log.
Did anything change in the subnet ?

> [1108414727:000284173][411FF970] -> umad_receiver: send completed with 
> error(method=1 attr=11) -- dropping.
> [1108414727:000384171][411FF970] -> umad_receiver: send completed with 
> error(method=1 attr=11) -- dropping.
> [1108414727:000484169][411FF970] -> umad_receiver: send completed with 
> error(method=1 attr=11) -- dropping.

These are failures of the OpenSM to send a SM Get(NodeInfo) which are
used during the periodic subnet sweeps. I think the only way this error
happens is if physical link is not present on the local link (e.g.
logical link is not in init state or beyond). 

So was a cable pulled somewhere ? 

Is this problem intermittent ? Does it come and go for no apparent
reason ? Does the subnet get out of this state or do you need to 
restart OpenSM ?

Are there any other messages in the log around this which might be
useful ? 

Thanks.

-- Hal

> 
> 
> 
> what's a reasonable thing to look for, or should I just svn update and 
> hope for the best?
> 
> thanks
> 
> ron
> _______________________________________________
> openib-general mailing list
> openib-general at openib.org
> http://openib.org/mailman/listinfo/openib-general
> 
> To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general




More information about the general mailing list