[ofa-general] Nodes dropping out of IPoIB mcast group due to a temporary node soft lockup.

Sasha Khapyorsky sashak at voltaire.com
Tue Apr 29 01:01:41 PDT 2008


On 11:03 Mon 28 Apr     , Ira Weiny wrote:
> 
> Yes I agree.  Per my previous mail to Or I found that light sweeps did not in
> fact notice the nodes were gone.  Looking at the logs I am not sure what
> caused OpenSM to notice them.  However, something must have triggered a heavy
> sweep when those nodes were catatonic.  From the logs they were unresponsive
> for multiple seconds, some as long as 30s.  It is still a bit of a mystery why
> OpenSM did a heavy sweep during this period but I don't think it is
> unreasonable for it to do so.

Could you send me log file?

Sasha



More information about the general mailing list