[openib-general] OpenSM crash

Hal Rosenstock halr at voltaire.com
Fri May 27 14:37:10 PDT 2005


On Fri, 2005-05-27 at 17:33, Roland Dreier wrote:
>     > May 27 01:44:09 [43005960] -> osm_vl15_post: 4294967295 MADs on wire, 2 MADs outstanding.
> 
>     Hal> I take that back. That's just a lot of MADs have been sent
>     Hal> (on the IB wire). OpenSM was probably up and running for a
>     Hal> while...
> 
> I find it hard to believe that OpenSM has sent 4 billion MADs --
> that's more than 1000 MADs a second for a solid month.  It also looks
> very suspicious that the value is equal to ((unsigned int) -1).
                                              ^^^^^^^^^^^^^^^^^^
on a 32 bit machine.

Good point. The fact that it gets to -1 is significant as I think that
is used as a magic value for some computations.

-- Hal




More information about the general mailing list