[openib-general] Re: Disabling IRQ #201 message

Michael S. Tsirkin mst at mellanox.co.il
Wed Feb 2 09:08:21 PST 2005


> > -----Original Message-----
> > From: Roland Dreier [mailto:roland at topspin.com]
> > Sent: Tuesday, February 01, 2005 6:31 PM
> > To: Robert Pearson
> > Cc: openib-general at openib.org
> > Subject: Re: [openib-general] Disabling IRQ #201 message
> > 
> >     Robert> Am running current version of openib on a 2.6.11-rc1
> >     Robert> kernel on a NewIsis dual Opteron system. Every 15-20
> >     Robert> minutes the following occurs. Have others seen this
> >     Robert> behavior? Is the system misconfigured?
> > 
> > Do the drivers work other than this messsage?
> > 
> > It seems occasionally an interrupt occurs but the driver is not
> > finding an events in any of the event queues.  I've never seen this
> > but on the other hand I've not done much testing on the
> > Opteron/AMD-8131 platform.
> > 
> >  - R.
> 

Quoting r. Robert Pearson <rpearson at systemfabricworks.com>:
> Subject: RE: Disabling IRQ #201 message
> 
> I'm debugging some code that is reading files in /sys/class/infiniband/.
> Other than that the HCA isn't doing anything at all. The dropped
> interrupt occurs whether or not I am doing anything. I can reboot the
> machine and just let it sit there and the message will occur after a
> while. After the message, files which require interacting with the HCA
> e.g. /sys/class/infiniband/mthca0/ports/1/state become unreadable. Read
> calls block for a long time and finally timeout with an EOF indication.

Do you have ip over ib loaded, and/or is sm running on the subnet?
If yes, things are happening every now and then: arp, sm sweep ... .

-- 
MST - Michael S. Tsirkin



More information about the general mailing list