[ofa-general] Re: ofed-1.4.1-rc2 NFS-RDMA server crash,

Jon Mason jon at opengridcomputing.com
Mon Mar 23 07:46:59 PDT 2009


On Fri, Mar 20, 2009 at 04:17:56PM -0700, Vu Pham wrote:
> Hi Jon,
>
> I ran connectathon test -N100 and get this crash on the server. Both  
> server/client are RHEL 5.2 x64 with connectX HCAs
>
> Should I open a bug# on bugzilla?

If you hit a bug, you should open one in bugzilla so it can be tracked.

Do you see the same behavior on mainline or is this isolated to the
RHEL5.2 backport?

Thanks,
Jon

>
> thanks,
> -vu

> Mar 20 10:05:34 vlab-007 kernel: EXT3-fs: mounted filesystem with ordered data m
> ode.                                                                            
> general protection fault: 0000 [1] svcrdma: error fast registering xdr for xprt 
> ffff81022e4f0c00SMP                                                             
> last sysfs file: /devices/pci0000:00/0000:00:00.0/local_cpus                    
> CPU 4                                                                           
> Modules linked in: svcrdma(U) nfsd(U) lockd(U) nfs_acl(U) auth_rpcgss(U) exportf
> s(U) autofs4 hidp rfcomm l2cap bluetooth sunrpc(U) rdma_ucm(U) rdma_cm(U) iw_cm(
> U) ib_addr(U) ib_ipoib(U) ipoib_helper(U) ib_cm(U) ib_sa(U) ipv6 xfrm_nalgo cryp
> to_api ib_uverbs(U) ib_umad(U) mlx4_ib(U) dm_mirror dm_multipath dm_mod raid0 vi
> deo sbs backlight i2c_ec button battery asus_acpi acpi_memhotplug ac parport_pc 
> lp parport i2c_i801 mlx4_core(U) e1000e serio_raw pcspkr ib_mthca(U) shpchp i2c_
> core ib_mad(U) ib_core(U) sg ata_piix libata mptsas mptscsih mptbase scsi_transp
> ort_sas sd_mod scsi_mod ext3 jbd uhci_hcd ohci_hcd ehci_hcd                     
> Pid: 0, comm: swapper Tainted: G      2.6.18-92.el5 #1                          
> RIP: 0010:[<ffffffff80149991>]  [<ffffffff80149991>] mark_clean+0x50/0x77       
> RSP: 0018:ffff81022fc6fe18  EFLAGS: 00010202                                    
> RAX: 5b98396687b9ba94 RBX: ffff81022349d0c0 RCX: 0000000000000080               
> RDX: 0000140d41402000 RSI: 0140d41402000000 RDI: 0140551402001000               
> RBP: 0140d41402000000 R08: 0140551402001000 R09: 5b98396687b9be94               
> R10: ffff81022fd68038 R11: ffffffff800928d3 R12: ffff81022e4f0c00               
> R13: 0000000000000000 R14: 0000000000000000 R15: ffffffff803c82e0               
> FS:  0000000000000000(0000) GS:ffff81022fc20d40(0000) knlGS:0000000000000000    
> CS:  0010 DS: 0018 ES: 0018 CR0: 000000008005003b                               
> CR2: 00002b5f92237000 CR3: 0000000215470000 CR4: 00000000000006e0               
> Process swapper (pid: 0, threadinfo ffff81022fc66000, task ffff81022fc21080)    
> Stack:  ffffffff885a45da ffff81022e4f0e60 ffff81022e4f0c00 ffff8102150d6140     
>  ffff8102152ca1c0 ffff81022c7e6600 ffffffff885a4a8f ffff8102150d6140            
>  ffffffff00000004 0000000000000032 ffff81022a338200 ffff81022a338af0            
> Call Trace:                                                                     
>  <IRQ>  [<ffffffff885a45da>] :svcrdma:svc_rdma_put_frmr+0xbc/0x117              
>  [<ffffffff885a4a8f>] :svcrdma:sq_cq_reap+0x11a/0x1a8                           
>  [<ffffffff80064a81>] _spin_lock_bh+0x9/0x14                                    
>  [<ffffffff885a53f8>] :svcrdma:dto_tasklet_func+0x13a/0x17a                     
>  [<ffffffff8821238d>] :mlx4_core:mlx4_eq_int+0x27e/0x28f                        
>  [<ffffffff800928d3>] tasklet_action+0x62/0xac                                  
>  [<ffffffff80011ed2>] __do_softirq+0x5e/0xd6                                    
>  [<ffffffff801549f5>] end_msi_irq_w_maskbit+0xf/0x1c                            
>  [<ffffffff8005e2fc>] call_softirq+0x1c/0x28                                    
>  [<ffffffff8006c571>] do_softirq+0x2c/0x85                                      
>  [<ffffffff8006c3f9>] do_IRQ+0xec/0xf5                                          
>  [<ffffffff80056c64>] mwait_idle+0x0/0x4a                                       
>  [<ffffffff8005d615>] ret_from_intr+0x0/0xa                                     
>  <EOI>  [<ffffffff80056c9a>] mwait_idle+0x36/0x4a                               
>  [<ffffffff80048b1d>] cpu_idle+0x95/0xb8                                        
>  [<ffffffff80076667>] start_secondary+0x45a/0x469                               
>                                                                                 
>                                                                                 
> Code: 49 8b 01 48 6b d2 38 48 83 e0 fc 48 01 d0 f0 0f ba 28 09 49               
> RIP  [<ffffffff80149991>] mark_clean+0x50/0x77                                  
>  RSP <ffff81022fc6fe18>




More information about the general mailing list