[ewg] ib_mthca failed NOP command problem

Matteo Tescione matteo at rmnet.it
Thu Apr 14 07:40:19 PDT 2011


Hi to all,

I just received a bunch of TopSpin dual port MT23108 with fw 3.5.9 pci-x, but when I try to modprobe ib_mthca module i get:

ib_mthca: Mellanox InfiniBand HCA driver v1.0-ofed1.5.3 (January 19, 2011)
ib_mthca: Initializing 0000:04:00.0
ib_mthca 0000:04:00.0: PCI INT A -> GSI 53 (level, low) -> IRQ 53
ib_mthca 0000:04:00.0: irq 74 for MSI/MSI-X
ib_mthca 0000:04:00.0: irq 75 for MSI/MSI-X
ib_mthca 0000:04:00.0: irq 76 for MSI/MSI-X
ib_mthca 0000:04:00.0: NOP command failed to generate interrupt (IRQ 76).
ib_mthca 0000:04:00.0: Trying again with MSI-X disabled.
ib_mthca 0000:04:00.0: NOP command failed to generate interrupt (IRQ 53), aborting.
ib_mthca 0000:04:00.0: BIOS or ACPI interrupt routing problem?
ib_mthca 0000:04:00.0: PCI INT A disabled
ib_mthca: probe of 0000:04:00.0 failed with error -16
ib_mthca: Mellanox InfiniBand HCA driver v1.0-ofed1.5.3 (January 19, 2011)
ib_mthca: Initializing 0000:04:00.0
ib_mthca 0000:04:00.0: PCI INT A -> GSI 53 (level, low) -> IRQ 53
ib_mthca 0000:04:00.0: Found bridge: 0000:03:02.0
ib_mthca 0000:04:00.0: FW version 000300050395, max commands 64
ib_mthca 0000:04:00.0: Catastrophic error buffer at 0xdfef8a04, size 0x10
ib_mthca 0000:04:00.0: FW supports commands through doorbells
ib_mthca 0000:04:00.0: Mapped doorbell page for posting FW commands
ib_mthca 0000:04:00.0: FW size 6143 KB (start ff7a00000, end ff7ffffff)
ib_mthca 0000:04:00.0: HCA memory size 131071 KB (start ff0000000, end ff7ffffff)
ib_mthca 0000:04:00.0: Max QPs: 16777216, reserved QPs: 1024, entry size: 256
ib_mthca 0000:04:00.0: Max SRQs: 1024, reserved SRQs: 16, entry size: 32
ib_mthca 0000:04:00.0: Max CQs: 16777216, reserved CQs: 128, entry size: 64
ib_mthca 0000:04:00.0: Max EQs: 64, reserved EQs: 1, entry size: 64
ib_mthca 0000:04:00.0: reserved MPTs: 16, reserved MTTs: 16
ib_mthca 0000:04:00.0: Max PDs: 16777216, reserved PDs: 0, reserved UARs: 1
ib_mthca 0000:04:00.0: Max QP/MCG: 16777216, reserved MGMs: 0
ib_mthca 0000:04:00.0: Max CQEs: 131072, max WQEs: 65535, max SRQ WQEs: 65535
ib_mthca 0000:04:00.0: Flags: 00370347
ib_mthca 0000:04:00.0: profile[ 0]--10/20 @ 0x       ff0000000 (size 0x 4000000)
ib_mthca 0000:04:00.0: profile[ 1]-- 0/16 @ 0x       ff4000000 (size 0x 1000000)
ib_mthca 0000:04:00.0: profile[ 2]-- 7/18 @ 0x       ff5000000 (size 0x  800000)
ib_mthca 0000:04:00.0: profile[ 3]-- 9/17 @ 0x       ff5800000 (size 0x  800000)
ib_mthca 0000:04:00.0: profile[ 4]-- 3/16 @ 0x       ff6000000 (size 0x  400000)
ib_mthca 0000:04:00.0: profile[ 5]-- 4/16 @ 0x       ff6400000 (size 0x  200000)
ib_mthca 0000:04:00.0: profile[ 6]-- 8/13 @ 0x       ff6600000 (size 0x  200000)
ib_mthca 0000:04:00.0: profile[ 7]--12/15 @ 0x       ff6800000 (size 0x  100000)
ib_mthca 0000:04:00.0: profile[ 8]--11/11 @ 0x       ff6900000 (size 0x   10000)
ib_mthca 0000:04:00.0: profile[ 9]-- 2/10 @ 0x       ff6910000 (size 0x    8000)
ib_mthca 0000:04:00.0: profile[10]-- 6/ 5 @ 0x       ff6918000 (size 0x     800)
ib_mthca 0000:04:00.0: HCA memory: allocated 107618 KB/124928 KB (17310 KB free)
ib_mthca 0000:04:00.0: irq 74 for MSI/MSI-X
ib_mthca 0000:04:00.0: irq 75 for MSI/MSI-X
ib_mthca 0000:04:00.0: irq 76 for MSI/MSI-X
ib_mthca 0000:04:00.0: Allocated EQ 1 with 131072 entries
ib_mthca 0000:04:00.0: Allocated EQ 2 with 256 entries
ib_mthca 0000:04:00.0: Allocated EQ 3 with 256 entries
ib_mthca 0000:04:00.0: Setting mask 00000000001f43fe for eqn 2
ib_mthca 0000:04:00.0: Setting mask 0000000000000400 for eqn 3
ib_mthca 0000:04:00.0: NOP command failed to generate interrupt (IRQ 76).
ib_mthca 0000:04:00.0: Trying again with MSI-X disabled.
ib_mthca 0000:04:00.0: Clearing mask 00000000001f43fe for eqn 2
ib_mthca 0000:04:00.0: Clearing mask 0000000000000400 for eqn 3
ib_mthca 0000:04:00.0: Allocated EQ 1 with 131072 entries
ib_mthca 0000:04:00.0: Allocated EQ 2 with 256 entries
ib_mthca 0000:04:00.0: Allocated EQ 3 with 256 entries
ib_mthca 0000:04:00.0: Setting mask 00000000001f43fe for eqn 2
ib_mthca 0000:04:00.0: Setting mask 0000000000000400 for eqn 3
ib_mthca 0000:04:00.0: NOP command failed to generate interrupt (IRQ 53), aborting.
ib_mthca 0000:04:00.0: BIOS or ACPI interrupt routing problem?
ib_mthca 0000:04:00.0: Clearing mask 00000000001f43fe for eqn 2
ib_mthca 0000:04:00.0: Clearing mask 0000000000000400 for eqn 3
ib_mthca 0000:04:00.0: PCI INT A disabled
ib_mthca: probe of 0000:04:00.0 failed with error -16




Reading previous post I tryied even with fw_cmd_doorbell 0 and 1 but all gives me the same result. I tried another card, another kernel, always the same.
Current kernel is 2.6.29, module is build from OFED-1.5.3.1 (latest). Hardware is Intel MB with dual Xeon E5520, 4gb RAM. the card is seen by lspci as:


03:02.0 PCI bridge: Mellanox Technologies MT23108 PCI Bridge (rev a1) (prog-if 00 [Normal decode])
        Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV+ VGASnoop- ParErr+ Stepping- SERR+ FastB2B-
        Status: Cap+ 66MHz+ UDF- FastB2B- ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR-
        Latency: 64, Cache Line Size: 32 bytes
        Bus: primary=03, secondary=04, subordinate=04, sec-latency=64
        I/O behind bridge: 0000f000-00000fff
        Memory behind bridge: dfe00000-dfefffff
        Prefetchable memory behind bridge: 0000000ff0000000-0000000ffff00000
        Secondary status: 66MHz+ FastB2B- ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- <SERR- <PERR-
        BridgeCtl: Parity+ SERR+ NoISA- VGA- MAbort- >Reset- FastB2B-
        Capabilities: [70] PCI-X bridge device
                Secondary Status: 64bit+ 133MHz+ SCD- USC- SCO- SRD- Freq=133MHz
                Status: Dev=03:02.0 64bit+ 133MHz+ SCD- USC- SCO- SRD-
                Upstream: Capacity=512 CommitmentLimit=512
                Downstream: Capacity=128 CommitmentLimit=128

04:00.0 InfiniBand: Mellanox Technologies MT23108 InfiniHost (rev a1)
        Subsystem: Mellanox Technologies MT23108 InfiniHost
        Control: I/O+ Mem+ BusMaster- SpecCycle- MemWINV+ VGASnoop- ParErr+ Stepping- SERR+ FastB2B-
        Status: Cap+ 66MHz+ UDF- FastB2B- ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR-
        Interrupt: pin A routed to IRQ 53
        Region 0: Memory at dfe00000 (64-bit, non-prefetchable) [size=1M]
        Region 2: Memory at fff800000 (64-bit, prefetchable) [size=8M]
        Region 4: Memory at ff0000000 (64-bit, prefetchable) [size=128M]
        Capabilities: [40] MSI-X: Enable- Mask- TabSize=32
                Vector table: BAR=0 offset=00082000
                PBA: BAR=0 offset=00082200
        Capabilities: [50] Vital Product Data
        Capabilities: [60] Message Signalled Interrupts: 64bit+ Queue=0/5 Enable-
                Address: 0000000000000000  Data: 0000
        Capabilities: [70] PCI-X non-bridge device
                Command: DPERE- ERO- RBC=4096 OST=2
                Status: Dev=04:00.0 64bit+ 133MHz+ SCD- USC- DC=simple DMMRBC=4096 DMOST=2 DMCRS=8 RSCEM- 266MHz- 533MHz-

Any hint?

Thanks in advance,
--
#Matteo Tescione
#RMnet
 



More information about the ewg mailing list