[openib-general] Kernel Oops in user-mad, mad

Hal Rosenstock halr at voltaire.com
Tue Oct 3 03:46:01 PDT 2006


On Tue, 2006-10-03 at 03:46, Jack Morgenstein wrote:
> On Sunday 01 October 2006 13:14, Michael S. Tsirkin wrote:
> > Quoting r. Jack Morgenstein <jackm at dev.mellanox.co.il>:
> > > Subject: Kernel Oops in user-mad, mad
> > > 
> > > We received the following kernel Oops while running regression
> > > (see console picture attached).
> > > 
> > > This looks like a possible race condition between handling umad send completions
> > > and ib_unregister_mad_agent.
> > > 
> > > The Oops is at the list_del line of dequeue_send (user_mad.c: 186)
> > > Note that ib_unregister_mad_agent invokes unregister_mad_agent->cancel_mads -> agent send handler.
> > > 
> > > Is there a possibility that there is a double deletion from a list somewhere?
> > > 
> > > Jack
> > > 
> > > 
> > > 
> > 
> > Was this during module unload?
> No.

What caused the ib_unregister_mad_agent routine to be invoked ? Was
OpenSM shutting down when this occurred ? Can you provide any more
details on the scenario which caused this ?

-- Hal






More information about the general mailing list