[ofa-general] Unable to handle kernel NULL pointer dereference

Minoru Hamakawa Minoru.Hamakawa at Sun.COM
Tue Feb 10 18:51:09 PST 2009


Hi experts,

Does anyone know the following panic??
--
Unable to handle kernel NULL pointer dereference at 0000000000000008 RIP:
 [<ffffffff8003686e>] kref_get+0x1/0x3d
...
--

It occurrs when we remove IB Cable from HCA and insert cable to HCA.
The HCA is X4217A-Z(Mellanox Technologies MT25418 [ConnectX IB DDR, PCIe
2.0 2.5GT/s] (rev a0))
OFED is 1.3.1.
And Kernel is 2.6.18-92.1.10.el5_lustre.1.6.6.20081218100335smp.
#Lustre patched kernel

Thank you in advance for your kind attention.
Should you have any queries please feel free to contact me.
And I appreciate if I could hear from you at your earliest convenience.

I'm not in this alias. please reply direct to me.

Best regards,
Minoru Hamakawa


ib0: multicast join failed for ff12:401b:ffff:0000:0000:0000:ffff:ffff,
status -11
ib0: multicast join failed for ff12:401b:ffff:0000:0000:0000:ffff:ffff,
status -11
ib0: multicast join failed for ff12:401b:ffff:0000:0000:0000:ffff:ffff,
status -11
Unable to handle kernel NULL pointer dereference at 0000000000000008 RIP:
 [<ffffffff8003686e>] kref_get+0x1/0x3d
PGD 40a158067 PUD 40a364067 PMD 0
Oops: 0000 [1] SMP
last sysfs file: /devices/pci0000:00/0000:00:1c.0/0000:0b:00.1/irq
CPU 0
Pid: 3752, comm: ib_mad1 Tainted: GF
2.6.18-92.1.10.el5_lustre.1.6.6.20081218100335smp #1
RIP: 0010:[<ffffffff8003686e>]  [<ffffffff8003686e>] kref_get+0x1/0x3d
RSP: 0018:ffff8104184f5cf0  EFLAGS: 00010002
RAX: ffff81040dcf3000 RBX: ffff81040dcf3000 RCX: 0000000000000000
RDX: 0000000000000100 RSI: ffff8104189f4dc0 RDI: 0000000000000008
RBP: ffff81040dcf3130 R08: 0000000000000000 R09: 0000000000000000
R10: 0000000000000000 R11: 0000000000000000 R12: ffff810416036828
R13: ffff8104189f4c18 R14: ffff8104189f4c00 R15: ffff8104184f7280
FS:  0000000000000000(0000) GS:ffffffff803ea000(0000) knlGS:0000000000000000
CS:  0010 DS: 0018 ES: 0018 CR0: 000000008005003b
CR2: 0000000000000008 CR3: 000000040a2f1000 CR4: 00000000000006e0
Process ib_mad1 (pid: 3752, threadinfo ffff8104184f4000, task
ffff81041b993100)
Stack:  ffff81040dcf3000 ffffffff88585668 040000001d420301 032801001dbe4000
 ae64ffff88432100 0000c0fe00007fff 00ba030001000000 0000001ac9cc0001
 4580a0d000000000 000000d000000000 fc89ef3000000000 0000000000002ad7
Call Trace:
 [<ffffffff88585668>] :ib_sa:notice_handler+0xaf/0x10b
 [<ffffffff883f8fd1>] :ib_mad:ib_mad_completion_handler+0x433/0x5e0
 [<ffffffff883f8b9e>] :ib_mad:ib_mad_completion_handler+0x0/0x5e0
 [<ffffffff8004cd60>] run_workqueue+0x94/0xe4
 [<ffffffff8004966b>] worker_thread+0x0/0x122
 [<ffffffff8009dcac>] keventd_create_kthread+0x0/0xc4
 [<ffffffff8004975b>] worker_thread+0xf0/0x122
 [<ffffffff8008acce>] default_wake_function+0x0/0xe
 [<ffffffff8009dcac>] keventd_create_kthread+0x0/0xc4
 [<ffffffff8009dcac>] keventd_create_kthread+0x0/0xc4
 [<ffffffff8003243b>] kthread+0xfe/0x132
 [<ffffffff8009dcac>] keventd_create_kthread+0x0/0xc4
 [<ffffffff8005dfb1>] child_rip+0xa/0x11
 [<ffffffff8009dcac>] keventd_create_kthread+0x0/0xc4
 [<ffffffff8003233d>] kthread+0x0/0x132
 [<ffffffff8005dfa7>] child_rip+0x0/0x11



More information about the general mailing list