[ofa-general] Re: ofed-1.4.1-rc2 NFS-RDMA server crash,
Jon Mason
jon at opengridcomputing.com
Mon Mar 23 07:46:59 PDT 2009
On Fri, Mar 20, 2009 at 04:17:56PM -0700, Vu Pham wrote:
> Hi Jon,
>
> I ran connectathon test -N100 and get this crash on the server. Both
> server/client are RHEL 5.2 x64 with connectX HCAs
>
> Should I open a bug# on bugzilla?
If you hit a bug, you should open one in bugzilla so it can be tracked.
Do you see the same behavior on mainline or is this isolated to the
RHEL5.2 backport?
Thanks,
Jon
>
> thanks,
> -vu
> Mar 20 10:05:34 vlab-007 kernel: EXT3-fs: mounted filesystem with ordered data m
> ode.
> general protection fault: 0000 [1] svcrdma: error fast registering xdr for xprt
> ffff81022e4f0c00SMP
> last sysfs file: /devices/pci0000:00/0000:00:00.0/local_cpus
> CPU 4
> Modules linked in: svcrdma(U) nfsd(U) lockd(U) nfs_acl(U) auth_rpcgss(U) exportf
> s(U) autofs4 hidp rfcomm l2cap bluetooth sunrpc(U) rdma_ucm(U) rdma_cm(U) iw_cm(
> U) ib_addr(U) ib_ipoib(U) ipoib_helper(U) ib_cm(U) ib_sa(U) ipv6 xfrm_nalgo cryp
> to_api ib_uverbs(U) ib_umad(U) mlx4_ib(U) dm_mirror dm_multipath dm_mod raid0 vi
> deo sbs backlight i2c_ec button battery asus_acpi acpi_memhotplug ac parport_pc
> lp parport i2c_i801 mlx4_core(U) e1000e serio_raw pcspkr ib_mthca(U) shpchp i2c_
> core ib_mad(U) ib_core(U) sg ata_piix libata mptsas mptscsih mptbase scsi_transp
> ort_sas sd_mod scsi_mod ext3 jbd uhci_hcd ohci_hcd ehci_hcd
> Pid: 0, comm: swapper Tainted: G 2.6.18-92.el5 #1
> RIP: 0010:[<ffffffff80149991>] [<ffffffff80149991>] mark_clean+0x50/0x77
> RSP: 0018:ffff81022fc6fe18 EFLAGS: 00010202
> RAX: 5b98396687b9ba94 RBX: ffff81022349d0c0 RCX: 0000000000000080
> RDX: 0000140d41402000 RSI: 0140d41402000000 RDI: 0140551402001000
> RBP: 0140d41402000000 R08: 0140551402001000 R09: 5b98396687b9be94
> R10: ffff81022fd68038 R11: ffffffff800928d3 R12: ffff81022e4f0c00
> R13: 0000000000000000 R14: 0000000000000000 R15: ffffffff803c82e0
> FS: 0000000000000000(0000) GS:ffff81022fc20d40(0000) knlGS:0000000000000000
> CS: 0010 DS: 0018 ES: 0018 CR0: 000000008005003b
> CR2: 00002b5f92237000 CR3: 0000000215470000 CR4: 00000000000006e0
> Process swapper (pid: 0, threadinfo ffff81022fc66000, task ffff81022fc21080)
> Stack: ffffffff885a45da ffff81022e4f0e60 ffff81022e4f0c00 ffff8102150d6140
> ffff8102152ca1c0 ffff81022c7e6600 ffffffff885a4a8f ffff8102150d6140
> ffffffff00000004 0000000000000032 ffff81022a338200 ffff81022a338af0
> Call Trace:
> <IRQ> [<ffffffff885a45da>] :svcrdma:svc_rdma_put_frmr+0xbc/0x117
> [<ffffffff885a4a8f>] :svcrdma:sq_cq_reap+0x11a/0x1a8
> [<ffffffff80064a81>] _spin_lock_bh+0x9/0x14
> [<ffffffff885a53f8>] :svcrdma:dto_tasklet_func+0x13a/0x17a
> [<ffffffff8821238d>] :mlx4_core:mlx4_eq_int+0x27e/0x28f
> [<ffffffff800928d3>] tasklet_action+0x62/0xac
> [<ffffffff80011ed2>] __do_softirq+0x5e/0xd6
> [<ffffffff801549f5>] end_msi_irq_w_maskbit+0xf/0x1c
> [<ffffffff8005e2fc>] call_softirq+0x1c/0x28
> [<ffffffff8006c571>] do_softirq+0x2c/0x85
> [<ffffffff8006c3f9>] do_IRQ+0xec/0xf5
> [<ffffffff80056c64>] mwait_idle+0x0/0x4a
> [<ffffffff8005d615>] ret_from_intr+0x0/0xa
> <EOI> [<ffffffff80056c9a>] mwait_idle+0x36/0x4a
> [<ffffffff80048b1d>] cpu_idle+0x95/0xb8
> [<ffffffff80076667>] start_secondary+0x45a/0x469
>
>
> Code: 49 8b 01 48 6b d2 38 48 83 e0 fc 48 01 d0 f0 0f ba 28 09 49
> RIP [<ffffffff80149991>] mark_clean+0x50/0x77
> RSP <ffff81022fc6fe18>
More information about the general
mailing list