[openib-general] Re: Disabling IRQ #201 message
Michael S. Tsirkin
mst at mellanox.co.il
Wed Feb 2 09:08:21 PST 2005
> > -----Original Message-----
> > From: Roland Dreier [mailto:roland at topspin.com]
> > Sent: Tuesday, February 01, 2005 6:31 PM
> > To: Robert Pearson
> > Cc: openib-general at openib.org
> > Subject: Re: [openib-general] Disabling IRQ #201 message
> >
> > Robert> Am running current version of openib on a 2.6.11-rc1
> > Robert> kernel on a NewIsis dual Opteron system. Every 15-20
> > Robert> minutes the following occurs. Have others seen this
> > Robert> behavior? Is the system misconfigured?
> >
> > Do the drivers work other than this messsage?
> >
> > It seems occasionally an interrupt occurs but the driver is not
> > finding an events in any of the event queues. I've never seen this
> > but on the other hand I've not done much testing on the
> > Opteron/AMD-8131 platform.
> >
> > - R.
>
Quoting r. Robert Pearson <rpearson at systemfabricworks.com>:
> Subject: RE: Disabling IRQ #201 message
>
> I'm debugging some code that is reading files in /sys/class/infiniband/.
> Other than that the HCA isn't doing anything at all. The dropped
> interrupt occurs whether or not I am doing anything. I can reboot the
> machine and just let it sit there and the message will occur after a
> while. After the message, files which require interacting with the HCA
> e.g. /sys/class/infiniband/mthca0/ports/1/state become unreadable. Read
> calls block for a long time and finally timeout with an EOF indication.
Do you have ip over ib loaded, and/or is sm running on the subnet?
If yes, things are happening every now and then: arp, sm sweep ... .
--
MST - Michael S. Tsirkin
More information about the general
mailing list