[ofa-general] Unable to handle kernel NULL pointer dereference
Minoru Hamakawa
Minoru.Hamakawa at Sun.COM
Tue Feb 10 18:51:09 PST 2009
Hi experts,
Does anyone know the following panic??
--
Unable to handle kernel NULL pointer dereference at 0000000000000008 RIP:
[<ffffffff8003686e>] kref_get+0x1/0x3d
...
--
It occurrs when we remove IB Cable from HCA and insert cable to HCA.
The HCA is X4217A-Z(Mellanox Technologies MT25418 [ConnectX IB DDR, PCIe
2.0 2.5GT/s] (rev a0))
OFED is 1.3.1.
And Kernel is 2.6.18-92.1.10.el5_lustre.1.6.6.20081218100335smp.
#Lustre patched kernel
Thank you in advance for your kind attention.
Should you have any queries please feel free to contact me.
And I appreciate if I could hear from you at your earliest convenience.
I'm not in this alias. please reply direct to me.
Best regards,
Minoru Hamakawa
ib0: multicast join failed for ff12:401b:ffff:0000:0000:0000:ffff:ffff,
status -11
ib0: multicast join failed for ff12:401b:ffff:0000:0000:0000:ffff:ffff,
status -11
ib0: multicast join failed for ff12:401b:ffff:0000:0000:0000:ffff:ffff,
status -11
Unable to handle kernel NULL pointer dereference at 0000000000000008 RIP:
[<ffffffff8003686e>] kref_get+0x1/0x3d
PGD 40a158067 PUD 40a364067 PMD 0
Oops: 0000 [1] SMP
last sysfs file: /devices/pci0000:00/0000:00:1c.0/0000:0b:00.1/irq
CPU 0
Pid: 3752, comm: ib_mad1 Tainted: GF
2.6.18-92.1.10.el5_lustre.1.6.6.20081218100335smp #1
RIP: 0010:[<ffffffff8003686e>] [<ffffffff8003686e>] kref_get+0x1/0x3d
RSP: 0018:ffff8104184f5cf0 EFLAGS: 00010002
RAX: ffff81040dcf3000 RBX: ffff81040dcf3000 RCX: 0000000000000000
RDX: 0000000000000100 RSI: ffff8104189f4dc0 RDI: 0000000000000008
RBP: ffff81040dcf3130 R08: 0000000000000000 R09: 0000000000000000
R10: 0000000000000000 R11: 0000000000000000 R12: ffff810416036828
R13: ffff8104189f4c18 R14: ffff8104189f4c00 R15: ffff8104184f7280
FS: 0000000000000000(0000) GS:ffffffff803ea000(0000) knlGS:0000000000000000
CS: 0010 DS: 0018 ES: 0018 CR0: 000000008005003b
CR2: 0000000000000008 CR3: 000000040a2f1000 CR4: 00000000000006e0
Process ib_mad1 (pid: 3752, threadinfo ffff8104184f4000, task
ffff81041b993100)
Stack: ffff81040dcf3000 ffffffff88585668 040000001d420301 032801001dbe4000
ae64ffff88432100 0000c0fe00007fff 00ba030001000000 0000001ac9cc0001
4580a0d000000000 000000d000000000 fc89ef3000000000 0000000000002ad7
Call Trace:
[<ffffffff88585668>] :ib_sa:notice_handler+0xaf/0x10b
[<ffffffff883f8fd1>] :ib_mad:ib_mad_completion_handler+0x433/0x5e0
[<ffffffff883f8b9e>] :ib_mad:ib_mad_completion_handler+0x0/0x5e0
[<ffffffff8004cd60>] run_workqueue+0x94/0xe4
[<ffffffff8004966b>] worker_thread+0x0/0x122
[<ffffffff8009dcac>] keventd_create_kthread+0x0/0xc4
[<ffffffff8004975b>] worker_thread+0xf0/0x122
[<ffffffff8008acce>] default_wake_function+0x0/0xe
[<ffffffff8009dcac>] keventd_create_kthread+0x0/0xc4
[<ffffffff8009dcac>] keventd_create_kthread+0x0/0xc4
[<ffffffff8003243b>] kthread+0xfe/0x132
[<ffffffff8009dcac>] keventd_create_kthread+0x0/0xc4
[<ffffffff8005dfb1>] child_rip+0xa/0x11
[<ffffffff8009dcac>] keventd_create_kthread+0x0/0xc4
[<ffffffff8003233d>] kthread+0x0/0x132
[<ffffffff8005dfa7>] child_rip+0x0/0x11
More information about the general
mailing list