[openib-general] question on opensm error
Hal Rosenstock
halr at voltaire.com
Tue Feb 15 03:50:11 PST 2005
Hi Ron,
On Mon, 2005-02-14 at 15:59, Ronald G. Minnich wrote:
> formerly working opensm starts to get these:
So the OpenSM was up and running and these messages appeared in the log.
Did anything change in the subnet ?
> [1108414727:000284173][411FF970] -> umad_receiver: send completed with
> error(method=1 attr=11) -- dropping.
> [1108414727:000384171][411FF970] -> umad_receiver: send completed with
> error(method=1 attr=11) -- dropping.
> [1108414727:000484169][411FF970] -> umad_receiver: send completed with
> error(method=1 attr=11) -- dropping.
These are failures of the OpenSM to send a SM Get(NodeInfo) which are
used during the periodic subnet sweeps. I think the only way this error
happens is if physical link is not present on the local link (e.g.
logical link is not in init state or beyond).
So was a cable pulled somewhere ?
Is this problem intermittent ? Does it come and go for no apparent
reason ? Does the subnet get out of this state or do you need to
restart OpenSM ?
Are there any other messages in the log around this which might be
useful ?
Thanks.
-- Hal
>
>
>
> what's a reasonable thing to look for, or should I just svn update and
> hope for the best?
>
> thanks
>
> ron
> _______________________________________________
> openib-general mailing list
> openib-general at openib.org
> http://openib.org/mailman/listinfo/openib-general
>
> To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general
More information about the general
mailing list