[ofa-general] Re: ofed-1.4.1-rc2 NFS-RDMA server crash,
Vu Pham
vuhuong at mellanox.com
Mon Mar 23 11:06:59 PDT 2009
Jon Mason wrote:
> On Fri, Mar 20, 2009 at 04:17:56PM -0700, Vu Pham wrote:
>
>> Hi Jon,
>>
>> I ran connectathon test -N100 and get this crash on the server. Both
>> server/client are RHEL 5.2 x64 with connectX HCAs
>>
>> Should I open a bug# on bugzilla?
>>
>
> If you hit a bug, you should open one in bugzilla so it can be tracked.
>
OK - I just open bug #1571
> Do you see the same behavior on mainline or is this isolated to the
> RHEL5.2 backport?
>
I run server on mainline kernel 2.6.27. The server fail at same place;
however, it does not crash
general protection fault: 0000 [1] svcrdma: error fast registering xdr
for xprt
ffff81022e4f0c00SMP
thanks,
-vu
> Thanks,
> Jon
>
>
>> thanks,
>> -vu
>>
>
>
>> Mar 20 10:05:34 vlab-007 kernel: EXT3-fs: mounted filesystem with ordered data m
>> ode.
>> general protection fault: 0000 [1] svcrdma: error fast registering xdr for xprt
>> ffff81022e4f0c00SMP
>> last sysfs file: /devices/pci0000:00/0000:00:00.0/local_cpus
>> CPU 4
>> Modules linked in: svcrdma(U) nfsd(U) lockd(U) nfs_acl(U) auth_rpcgss(U) exportf
>> s(U) autofs4 hidp rfcomm l2cap bluetooth sunrpc(U) rdma_ucm(U) rdma_cm(U) iw_cm(
>> U) ib_addr(U) ib_ipoib(U) ipoib_helper(U) ib_cm(U) ib_sa(U) ipv6 xfrm_nalgo cryp
>> to_api ib_uverbs(U) ib_umad(U) mlx4_ib(U) dm_mirror dm_multipath dm_mod raid0 vi
>> deo sbs backlight i2c_ec button battery asus_acpi acpi_memhotplug ac parport_pc
>> lp parport i2c_i801 mlx4_core(U) e1000e serio_raw pcspkr ib_mthca(U) shpchp i2c_
>> core ib_mad(U) ib_core(U) sg ata_piix libata mptsas mptscsih mptbase scsi_transp
>> ort_sas sd_mod scsi_mod ext3 jbd uhci_hcd ohci_hcd ehci_hcd
>> Pid: 0, comm: swapper Tainted: G 2.6.18-92.el5 #1
>> RIP: 0010:[<ffffffff80149991>] [<ffffffff80149991>] mark_clean+0x50/0x77
>> RSP: 0018:ffff81022fc6fe18 EFLAGS: 00010202
>> RAX: 5b98396687b9ba94 RBX: ffff81022349d0c0 RCX: 0000000000000080
>> RDX: 0000140d41402000 RSI: 0140d41402000000 RDI: 0140551402001000
>> RBP: 0140d41402000000 R08: 0140551402001000 R09: 5b98396687b9be94
>> R10: ffff81022fd68038 R11: ffffffff800928d3 R12: ffff81022e4f0c00
>> R13: 0000000000000000 R14: 0000000000000000 R15: ffffffff803c82e0
>> FS: 0000000000000000(0000) GS:ffff81022fc20d40(0000) knlGS:0000000000000000
>> CS: 0010 DS: 0018 ES: 0018 CR0: 000000008005003b
>> CR2: 00002b5f92237000 CR3: 0000000215470000 CR4: 00000000000006e0
>> Process swapper (pid: 0, threadinfo ffff81022fc66000, task ffff81022fc21080)
>> Stack: ffffffff885a45da ffff81022e4f0e60 ffff81022e4f0c00 ffff8102150d6140
>> ffff8102152ca1c0 ffff81022c7e6600 ffffffff885a4a8f ffff8102150d6140
>> ffffffff00000004 0000000000000032 ffff81022a338200 ffff81022a338af0
>> Call Trace:
>> <IRQ> [<ffffffff885a45da>] :svcrdma:svc_rdma_put_frmr+0xbc/0x117
>> [<ffffffff885a4a8f>] :svcrdma:sq_cq_reap+0x11a/0x1a8
>> [<ffffffff80064a81>] _spin_lock_bh+0x9/0x14
>> [<ffffffff885a53f8>] :svcrdma:dto_tasklet_func+0x13a/0x17a
>> [<ffffffff8821238d>] :mlx4_core:mlx4_eq_int+0x27e/0x28f
>> [<ffffffff800928d3>] tasklet_action+0x62/0xac
>> [<ffffffff80011ed2>] __do_softirq+0x5e/0xd6
>> [<ffffffff801549f5>] end_msi_irq_w_maskbit+0xf/0x1c
>> [<ffffffff8005e2fc>] call_softirq+0x1c/0x28
>> [<ffffffff8006c571>] do_softirq+0x2c/0x85
>> [<ffffffff8006c3f9>] do_IRQ+0xec/0xf5
>> [<ffffffff80056c64>] mwait_idle+0x0/0x4a
>> [<ffffffff8005d615>] ret_from_intr+0x0/0xa
>> <EOI> [<ffffffff80056c9a>] mwait_idle+0x36/0x4a
>> [<ffffffff80048b1d>] cpu_idle+0x95/0xb8
>> [<ffffffff80076667>] start_secondary+0x45a/0x469
>>
>>
>> Code: 49 8b 01 48 6b d2 38 48 83 e0 fc 48 01 d0 f0 0f ba 28 09 49
>> RIP [<ffffffff80149991>] mark_clean+0x50/0x77
>> RSP <ffff81022fc6fe18>
>>
>
>
More information about the general
mailing list