[openib-general][patch review] srp: fmr implementation,
Vu Pham
vuhuong at mellanox.com
Thu Apr 13 11:38:00 PDT 2006
Hi Roland,
>> Apr 7 18:17:17 lab105 kernel: Unable to handle kernel paging request at virtual address 6b6b6b6b6b6b6b6b
>
> I think I fixed the bug causing this oops (I was able to reproduce it,
> and I don't see it any more). I checked the following patch in and
> queued it for kernel 2.6.17:
>
My ia64 system still crashes with the patch applied. Please see log below
Apr 13 13:10:21 lab105 kernel: Abort for req_index 1
Apr 13 13:10:26 lab105 kernel: ib_srp: SRP reset_host called
Apr 13 13:10:28 lab105 kernel: ib_srp: connection closed
Apr 13 13:10:28 lab105 kernel: Unable to handle kernel paging request at
virtual address 6b6b6b6b6b6b6b6b
Apr 13 13:10:28 lab105 kernel: scsi_eh_2[13324]: Oops 11012296146944 [1]
Apr 13 13:10:28 lab105 kernel: Modules linked in: ib_srp ib_cm ib_sa
evdev joydev sg st sr_mod ide_cd cdrom usbserial parport_pc lp parport
ipv6 thermal processor fan button binfmt_misc usbhid ib_mthca ib_mad
ib_core ehci_hcd uhci_hcd usbcore i2c_i801 i2c_core e1000 nls_iso8859_1
nls_cp437 dm_mod reiserfs mptspi scsi_transport_spi mptscsih mptbase
sd_mod scsi_mod
Apr 13 13:10:28 lab105 kernel:
Apr 13 13:10:28 lab105 kernel: Pid: 13324, CPU 1, comm: scsi_eh_2
Apr 13 13:10:28 lab105 kernel: psr : 0000121008026018 ifs :
800000000000050d ip : [<a00000020235a0f1>] Not tainted
Apr 13 13:10:28 lab105 kernel: ip is at srp_reconnect_target+0x2b1/0x5c0
[ib_srp]
Apr 13 13:10:28 lab105 kernel: unat: 0000000000000000 pfs :
000000000000050d rsc : 0000000000000003
Apr 13 13:10:28 lab105 kernel: rnat: 0000000000000000 bsps:
0000000000000000 pr : 0000000000009541
Apr 13 13:10:28 lab105 kernel: ldrs: 0000000000000000 ccv :
0000000000000000 fpsr: 0009804c8a70433f
Apr 13 13:10:28 lab105 kernel: csd : 0000000000000000 ssd : 0000000000000000
Apr 13 13:10:28 lab105 kernel: b0 : a00000020235a060 b6 :
a000000100003320 b7 : a0000002023ddd80
Apr 13 13:10:28 lab105 kernel: f6 : 1003e6b6b6b6b6b6b6b6b f7 :
0ffdd8000000000000000
Apr 13 13:10:28 lab105 kernel: f8 : 1003e0000000000003598 f9 :
1003e0000000000000118
Apr 13 13:10:28 lab105 kernel: f10 : 1003e0000000000000000 f11 :
1003e0000000000000000
Apr 13 13:10:28 lab105 kernel: r1 : a00000020235c200 r2 :
e0000001e58f8b58 r3 : e00000018d748a40
Apr 13 13:10:28 lab105 kernel: r8 : e0000001e58f8ba8 r9 :
e0000001e58f89f8 r10 : a000000100931338
Apr 13 13:10:28 lab105 kernel: r11 : 0000000000000001 r12 :
e0000001ea8f7d00 r13 : e0000001ea8f0000
Apr 13 13:10:28 lab105 kernel: r14 : a000000100931340 r15 :
e0000001ea8f0000 r16 : 0000000000000001
Apr 13 13:10:28 lab105 kernel: r17 : 0000000000000001 r18 :
e0000001ea8f0f84 r19 : a000000100931348
Apr 13 13:10:28 lab105 kernel: r20 : ffffffffffffffff r21 :
0000000000000008 r22 : e00000000479c980
Apr 13 13:10:28 lab105 kernel: r23 : e0000001f5e7a920 r24 :
0000000000000080 r25 : e00000000479c99f
Apr 13 13:10:28 lab105 kernel: r26 : a0000002023ddd80 r27 :
e000000187d4c1e0 r28 : e000000187d4c000
Apr 13 13:10:28 lab105 kernel: r29 : e0000001f5e7a880 r30 :
e00000018d748ab8 r31 : e00000018d748a20
Apr 13 13:10:28 lab105 kernel:
Apr 13 13:10:28 lab105 kernel: Call Trace:
Apr 13 13:10:28 lab105 kernel: [<a000000100013000>] show_stack+0x80/0xa0
Apr 13 13:10:28 lab105 kernel:
sp=e0000001ea8f7880 bsp=e0000001ea8f1308
Apr 13 13:10:28 lab105 kernel: [<a000000100013860>] show_regs+0x840/0x880
Apr 13 13:10:28 lab105 kernel:
sp=e0000001ea8f7a50 bsp=e0000001ea8f12a8
Apr 13 13:10:28 lab105 kernel: [<a000000100035a10>] die+0x1b0/0x2e0
Apr 13 13:10:28 lab105 kernel:
sp=e0000001ea8f7a60 bsp=e0000001ea8f1260
Apr 13 13:10:28 lab105 kernel: [<a000000100057840>]
ia64_do_page_fault+0x9a0/0xb20
Apr 13 13:10:28 lab105 kernel:
sp=e0000001ea8f7a80 bsp=e0000001ea8f11f0
Apr 13 13:10:28 lab105 kernel: [<a00000010000bc80>]
ia64_leave_kernel+0x0/0x280
Apr 13 13:10:28 lab105 kernel:
sp=e0000001ea8f7b30 bsp=e0000001ea8f11f0
Apr 13 13:10:28 lab105 kernel: [<a00000020235a0f0>]
srp_reconnect_target+0x2b0/0x5c0 [ib_srp]
Apr 13 13:10:28 lab105 kernel:
sp=e0000001ea8f7d00 bsp=e0000001ea8f1188
Apr 13 13:10:28 lab105 kernel: [<a00000020235a460>]
srp_reset_host+0x60/0xa0 [ib_srp]
Apr 13 13:10:28 lab105 kernel:
sp=e0000001ea8f7dc0 bsp=e0000001ea8f1160
Apr 13 13:10:28 lab105 kernel: [<a000000201b2f4d0>]
scsi_try_host_reset+0xd0/0x240 [scsi_mod]
Apr 13 13:10:28 lab105 kernel:
sp=e0000001ea8f7dc0 bsp=e0000001ea8f1130
Apr 13 13:10:28 lab105 kernel: [<a000000201b320a0>]
scsi_error_handler+0x1860/0x2000 [scsi_mod]
Apr 13 13:10:28 lab105 kernel:
sp=e0000001ea8f7dc0 bsp=e0000001ea8f1040
Apr 13 13:10:28 lab105 kernel: [<a0000001000b98e0>] kthread+0x220/0x280
Apr 13 13:10:28 lab105 kernel:
sp=e0000001ea8f7e10 bsp=e0000001ea8f1000
Apr 13 13:10:28 lab105 kernel: [<a000000100011440>]
kernel_thread_helper+0xe0/0x100
Apr 13 13:10:28 lab105 kernel:
sp=e0000001ea8f7e30 bsp=e0000001ea8f0fd0
Apr 13 13:10:28 lab105 kernel: [<a000000100009140>]
start_kernel_thread+0x20/0x40
Apr 13 13:10:28 lab105 kernel:
sp=e0000001ea8f7e30 bsp=e0000001ea8f0fd0
Apr 13 13:10:35 lab105 kernel: <3>Slab corruption:
start=e0000001e58f89f8, len=448
Apr 13 13:10:35 lab105 kernel: Redzone: 0x5a2cf071/0x5a2cf071.
Apr 13 13:10:35 lab105 kernel: Last user:
[<a000000201b289f0>](scsi_put_command+0x150/0x1c0 [scsi_mod])
Apr 13 13:10:35 lab105 kernel: 1b0: 00 00 08 00 6b 6b 6b 6b 6b 6b 6b 6b
6b 6b 6b a5
Apr 13 13:10:35 lab105 kernel: Prev obj: start=e0000001e58f8820, len=448
Apr 13 13:10:35 lab105 kernel: Redzone: 0x5a2cf071/0x5a2cf071.
Apr 13 13:10:35 lab105 kernel: Last user:
[<a000000201b289f0>](scsi_put_command+0x150/0x1c0 [scsi_mod])
Apr 13 13:10:35 lab105 kernel: 000: 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b
6b 6b 6b 6b
Apr 13 13:10:35 lab105 kernel: 010: 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b 6b
6b 6b 6b 6b
More information about the general
mailing list