[openib-general] crash in mthca soon after loading drivers
Sean Hefty
mshefty at ichips.intel.com
Wed Dec 8 14:57:20 PST 2004
I'm getting the following bug in mthca when loading the drivers (core,
mad, and mthca). The system is attached to a fabric with opensm
running on top of the Mellanox gold software stack. I hit this when
running with the tip of openib. Any help would be, well, helpful.
- Sean
Dec 8 14:53:47 mshefty-linux2 kernel: kernel BUG at
drivers/infiniband/hw/mthca/mthca_cmd.c:328!
Dec 8 14:53:47 mshefty-linux2 kernel: invalid operand: 0000 [#1]
Dec 8 14:53:47 mshefty-linux2 kernel: SMP
Dec 8 14:53:47 mshefty-linux2 kernel: Modules linked in: ib_mthca
ib_mad ib_core edd st sr_mod ide_cd cdrom thermal processor fan button
battery ac e100 mii e1000 hw_random uhci_hcd usbcore evdev reiserfs
aic7xxx sd_mod scsi_mod
Dec 8 14:53:47 mshefty-linux2 kernel: CPU: 0
Dec 8 14:53:47 mshefty-linux2 kernel: EIP:
0060:[pg0+948359147/1069220864] Not tainted VLI
Dec 8 14:53:47 mshefty-linux2 kernel: EIP: 0060:[<f8cbafeb>] Not
tainted VLI
Dec 8 14:53:47 mshefty-linux2 kernel: EFLAGS: 00010286 (2.6.9)
Dec 8 14:53:47 mshefty-linux2 kernel: EIP is at
mthca_cmd_wait+0x19b/0x1b0 [ib_mthca]
Dec 8 14:53:47 mshefty-linux2 kernel: eax: f6b245a0 ebx: f6b24584
ecx: f6b24584 edx: ffffffff
Dec 8 14:53:47 mshefty-linux2 kernel: esi: 32b50000 edi: f6b24324
ebp: f6b245a0 esp: f29d7e34
Dec 8 14:53:47 mshefty-linux2 kernel: ds: 007b es: 007b ss: 0068
Dec 8 14:53:47 mshefty-linux2 kernel: Process ib_mad1 (pid: 9900,
threadinfo=f29d6000 task=f744a710)
Dec 8 14:53:47 mshefty-linux2 kernel: Stack: 00000024 32b50000
00000000 f6b24324 32b50000 00000000 0000ea60 f8cbb058
Dec 8 14:53:47 mshefty-linux2 kernel: f29d7e70 00000000
00000001 00000000 00000024 0000ea60 f29d7edb 32b50100
Dec 8 14:53:47 mshefty-linux2 kernel: 00000000 00000001
00000000 f2b50100 f2b50000 f8cbd265 32b50100 00000000
Dec 8 14:53:47 mshefty-linux2 kernel: Call Trace:
Dec 8 14:53:47 mshefty-linux2 kernel: [pg0+948359256/1069220864]
mthca_cmd_box+0x58/0x90 [ib_mthca]
Dec 8 14:53:47 mshefty-linux2 kernel: [<f8cbb058>]
mthca_cmd_box+0x58/0x90 [ib_mthca]
Dec 8 14:53:47 mshefty-linux2 kernel: [pg0+948367973/1069220864]
mthca_MAD_IFC+0x85/0xf0 [ib_mthca]
Dec 8 14:53:47 mshefty-linux2 kernel: [<f8cbd265>]
mthca_MAD_IFC+0x85/0xf0 [ib_mthca]
Dec 8 14:53:47 mshefty-linux2 kernel: [check_poison_obj+45/432]
check_poison_obj+0x2d/0x1b0
Dec 8 14:53:47 mshefty-linux2 kernel: [<c0144e4d>]
check_poison_obj+0x2d/0x1b0
Dec 8 14:53:47 mshefty-linux2 kernel: [pg0+948399983/1069220864]
mthca_process_mad+0xcf/0x1c0 [ib_mthca]
Dec 8 14:53:47 mshefty-linux2 kernel: [<f8cc4f6f>]
mthca_process_mad+0xcf/0x1c0 [ib_mthca]
Dec 8 14:53:47 mshefty-linux2 kernel: [pg0+948399776/1069220864]
mthca_process_mad+0x0/0x1c0 [ib_mthca]
Dec 8 14:53:47 mshefty-linux2 kernel: [<f8cc4ea0>]
mthca_process_mad+0x0/0x1c0 [ib_mthca]
Dec 8 14:53:47 mshefty-linux2 kernel: [pg0+946047120/1069220864]
ib_mad_recv_done_handler+0xd0/0x230 [ib_mad]
Dec 8 14:53:47 mshefty-linux2 kernel: [<f8a86890>]
ib_mad_recv_done_handler+0xd0/0x230 [ib_mad]
Dec 8 14:53:47 mshefty-linux2 kernel: [pg0+946048724/1069220864]
ib_mad_completion_handler+0x94/0xa0 [ib_mad]
Dec 8 14:53:47 mshefty-linux2 kernel: [<f8a86ed4>]
ib_mad_completion_handler+0x94/0xa0 [ib_mad]
Dec 8 14:53:47 mshefty-linux2 kernel: [remove_wait_queue+12/64]
remove_wait_queue+0xc/0x40
Dec 8 14:53:47 mshefty-linux2 kernel: [<c011f97c>]
remove_wait_queue+0xc/0x40
Dec 8 14:53:47 mshefty-linux2 kernel: [worker_thread+424/560]
worker_thread+0x1a8/0x230
Dec 8 14:53:47 mshefty-linux2 kernel: [<c01305f8>]
worker_thread+0x1a8/0x230
Dec 8 14:53:47 mshefty-linux2 kernel: [pg0+946048576/1069220864]
ib_mad_completion_handler+0x0/0xa0 [ib_mad]
Dec 8 14:53:47 mshefty-linux2 kernel: [<f8a86e40>]
ib_mad_completion_handler+0x0/0xa0 [ib_mad]
Dec 8 14:53:47 mshefty-linux2 kernel: [default_wake_function+0/16]
default_wake_function+0x0/0x10
Dec 8 14:53:47 mshefty-linux2 kernel: [<c011e460>]
default_wake_function+0x0/0x10
Dec 8 14:53:47 mshefty-linux2 kernel: [default_wake_function+0/16]
default_wake_function+0x0/0x10
Dec 8 14:53:47 mshefty-linux2 kernel: [<c011e460>]
default_wake_function+0x0/0x10
Dec 8 14:53:47 mshefty-linux2 kernel: [worker_thread+0/560]
worker_thread+0x0/0x230
Dec 8 14:53:47 mshefty-linux2 kernel: [<c0130450>]
worker_thread+0x0/0x230
Dec 8 14:53:47 mshefty-linux2 kernel: [kthread+136/176] kthread+0x88/0xb0
Dec 8 14:53:47 mshefty-linux2 kernel: [<c0134128>] kthread+0x88/0xb0
Dec 8 14:53:47 mshefty-linux2 kernel: [kthread+0/176] kthread+0x0/0xb0
Dec 8 14:53:47 mshefty-linux2 kernel: [<c01340a0>] kthread+0x0/0xb0
Dec 8 14:53:47 mshefty-linux2 kernel: [kernel_thread_helper+5/16]
kernel_thread_helper+0x5/0x10
Dec 8 14:53:47 mshefty-linux2 kernel: [<c0105275>]
kernel_thread_helper+0x5/0x10
Dec 8 14:53:47 mshefty-linux2 kernel: Code: 14 d2 89 d0 c1 e0 09 29 d0
89 c2 c1 e2 12 01 d0 f7 d8 89 87 84 02 00 00 89 e8 e8 51 92 64 c7 89 da
83 c4 0c 89 d0 5b 5e 5f 5d c3 <0f> 0b 48 01 40 7b cc f8 e9 c7 fe ff ff
90 8d b4 26 00 00 00 00
More information about the general
mailing list