[openib-general] ib_mthca panic on PPC64

Troy Benjegerdes hozer at hozed.org
Thu Oct 27 09:23:36 PDT 2005


I got this the other day (before I had a chance to add the debug code)

p5l0:~# [443954.161068] mthca0: ib_query_pkey port 0 failed (ret = -22)
[443988.334644] mthca0: ib_query_pkey port 0 failed (ret = -22)
[444037.579342] ib_mthca: Mellanox InfiniBand HCA driver v0.06 (June 23, 2005)
[444037.579360] ib_mthca: Initializing 0000:d9:00.0
[444101.503664] ib_mthca: Mellanox InfiniBand HCA driver v0.06 (June 23, 2005)
[444101.503682] ib_mthca: Initializing 0000:d9:00.0
[444107.815375] Oops: Kernel access of bad area, sig: 7 [#1]
[444107.815389] SMP NR_CPUS=8 NUMA PSERIES LPAR
[444107.815401] Modules linked in: ib_ipoib ib_sa ib_mthca ib_mad ib_core openaf
s
[444107.815425] NIP: D0000000098BF638 XER: 20000018 LR: C000000000057B2C CTR: D0
000000098BF5D0
[444107.815440] REGS: c0000001ee79b490 TRAP: 0300   Tainted: P       (2.6.13.3-p
ower5)
[444107.815455] MSR: 8000000000009032 EE: 1 PR: 0 FP: 0 ME: 1 IR/DR: 11 CR: 2800
0084
[444107.815469] DAR: d000010082189a04 DSISR: 0000000040000000
[444107.815481] TASK: c0000001ee7950e0[0] 'swapper' THREAD: c0000001ee798000 CPU
: 6
[444107.815494] GPR00: 0000000000000010 C0000001EE79B710 D0000000098D6540 D00001
0082189A04
[444107.815515] GPR04: 0000000000000008 00000001009D0180 0000000000000000 000000
0000000800
[444107.815535] GPR08: C0000003DDA91910 0000000000000000 C0000001EE79B840 D000010082189A04
[444107.815556] GPR12: 0000000048000082 C0000000004BF400 0000000000000000 0000000000C00060
[444107.815576] GPR16: 0000000000000006 0000000000000000 0000000000000000 0000000000000000
[444107.815595] GPR20: 0000000000000000 C0000000005F7ED8 C0000000005F7F40 C000000000606500
[444107.815617] GPR24: C0000001ECEFC498 C0000001EE79B840 C0000001EE798000 C0000003DDA91000
[444107.815639] GPR28: 0000000000000100 C0000003DDA91000 D0000000098D4EC0 0000000000000000
[444107.815661] NIP [d0000000098bf638] .poll_catas+0x68/0x2f0 [ib_mthca]
[444107.815699] LR [c000000000057b2c] .run_timer_softirq+0x15c/0x260
[444107.815717] Call Trace:
[444107.815725] [c0000001ee79b710] [c0000001ee79b7c0] 0xc0000001ee79b7c0 (unreliable)
[444107.815744] [c0000001ee79b7d0] [c000000000057b2c] .run_timer_softirq+0x15c/0x260
[444107.815764] [c0000001ee79b890] [c000000000051e68] .__do_softirq+0xe8/0x1c0
[444107.815783] [c0000001ee79b950] [c000000000051fc4] .do_softirq+0x84/0x90
[444107.815801] [c0000001ee79b9d0] [c0000000000108f0] .timer_interrupt+0xd0/0x41
0
[444107.815821] [c0000001ee79bad0] [c00000000000a2b4] decrementer_common+0xb4/0x100
[444107.815838] --- Exception: 901 at .pseries_dedicated_idle+0x104/0x280
[444107.815857]     LR = .pseries_dedicated_idle+0x1e0/0x280
[444107.815868] [c0000001ee79be90] [c00000000000f460] .cpu_idle+0x40/0x60
[444107.815886] [c0000001ee79bf00] [c000000000032fa0] .start_secondary+0x120/0x150
[444107.815905] [c0000001ee79bf90] [c00000000000ba7c] .enable_64b_mode+0x0/0x28
[444107.815922] Instruction dump:
[444107.815930] 3be00000 48000020 2fab0000 381f0001 7c1f07b4 409e0058 801d0908 7f9f0040
[444107.815955] 409c00c8 e97d08f8 7be91764 7c6b4a14 <7c001c2c> 0c000000 4c00012c 780b0020
[444107.815983]  <0>Kernel panic - not syncing: Fatal exception in interrupt
[444107.815998]




More information about the general mailing list