[ofa-general] Re: ofed-1.4.1-rc2 NFS-RDMA server crash,

Vu Pham vuhuong at mellanox.com
Mon Mar 23 11:06:59 PDT 2009


Jon Mason wrote:
> On Fri, Mar 20, 2009 at 04:17:56PM -0700, Vu Pham wrote:
>   
>> Hi Jon,
>>
>> I ran connectathon test -N100 and get this crash on the server. Both  
>> server/client are RHEL 5.2 x64 with connectX HCAs
>>
>> Should I open a bug# on bugzilla?
>>     
>
> If you hit a bug, you should open one in bugzilla so it can be tracked.
>   

OK - I just open bug #1571
> Do you see the same behavior on mainline or is this isolated to the
> RHEL5.2 backport?
>   

I run server on mainline kernel 2.6.27. The server fail at same place; 
however, it does not crash

general protection fault: 0000 [1] svcrdma: error fast registering xdr 
for xprt
ffff81022e4f0c00SMP

thanks,
-vu

   
> Thanks,
> Jon
>
>   
>> thanks,
>> -vu
>>     
>
>   
>> Mar 20 10:05:34 vlab-007 kernel: EXT3-fs: mounted filesystem with ordered data m
>> ode.                                                                            
>> general protection fault: 0000 [1] svcrdma: error fast registering xdr for xprt 
>> ffff81022e4f0c00SMP                                                             
>> last sysfs file: /devices/pci0000:00/0000:00:00.0/local_cpus                    
>> CPU 4                                                                           
>> Modules linked in: svcrdma(U) nfsd(U) lockd(U) nfs_acl(U) auth_rpcgss(U) exportf
>> s(U) autofs4 hidp rfcomm l2cap bluetooth sunrpc(U) rdma_ucm(U) rdma_cm(U) iw_cm(
>> U) ib_addr(U) ib_ipoib(U) ipoib_helper(U) ib_cm(U) ib_sa(U) ipv6 xfrm_nalgo cryp
>> to_api ib_uverbs(U) ib_umad(U) mlx4_ib(U) dm_mirror dm_multipath dm_mod raid0 vi
>> deo sbs backlight i2c_ec button battery asus_acpi acpi_memhotplug ac parport_pc 
>> lp parport i2c_i801 mlx4_core(U) e1000e serio_raw pcspkr ib_mthca(U) shpchp i2c_
>> core ib_mad(U) ib_core(U) sg ata_piix libata mptsas mptscsih mptbase scsi_transp
>> ort_sas sd_mod scsi_mod ext3 jbd uhci_hcd ohci_hcd ehci_hcd                     
>> Pid: 0, comm: swapper Tainted: G      2.6.18-92.el5 #1                          
>> RIP: 0010:[<ffffffff80149991>]  [<ffffffff80149991>] mark_clean+0x50/0x77       
>> RSP: 0018:ffff81022fc6fe18  EFLAGS: 00010202                                    
>> RAX: 5b98396687b9ba94 RBX: ffff81022349d0c0 RCX: 0000000000000080               
>> RDX: 0000140d41402000 RSI: 0140d41402000000 RDI: 0140551402001000               
>> RBP: 0140d41402000000 R08: 0140551402001000 R09: 5b98396687b9be94               
>> R10: ffff81022fd68038 R11: ffffffff800928d3 R12: ffff81022e4f0c00               
>> R13: 0000000000000000 R14: 0000000000000000 R15: ffffffff803c82e0               
>> FS:  0000000000000000(0000) GS:ffff81022fc20d40(0000) knlGS:0000000000000000    
>> CS:  0010 DS: 0018 ES: 0018 CR0: 000000008005003b                               
>> CR2: 00002b5f92237000 CR3: 0000000215470000 CR4: 00000000000006e0               
>> Process swapper (pid: 0, threadinfo ffff81022fc66000, task ffff81022fc21080)    
>> Stack:  ffffffff885a45da ffff81022e4f0e60 ffff81022e4f0c00 ffff8102150d6140     
>>  ffff8102152ca1c0 ffff81022c7e6600 ffffffff885a4a8f ffff8102150d6140            
>>  ffffffff00000004 0000000000000032 ffff81022a338200 ffff81022a338af0            
>> Call Trace:                                                                     
>>  <IRQ>  [<ffffffff885a45da>] :svcrdma:svc_rdma_put_frmr+0xbc/0x117              
>>  [<ffffffff885a4a8f>] :svcrdma:sq_cq_reap+0x11a/0x1a8                           
>>  [<ffffffff80064a81>] _spin_lock_bh+0x9/0x14                                    
>>  [<ffffffff885a53f8>] :svcrdma:dto_tasklet_func+0x13a/0x17a                     
>>  [<ffffffff8821238d>] :mlx4_core:mlx4_eq_int+0x27e/0x28f                        
>>  [<ffffffff800928d3>] tasklet_action+0x62/0xac                                  
>>  [<ffffffff80011ed2>] __do_softirq+0x5e/0xd6                                    
>>  [<ffffffff801549f5>] end_msi_irq_w_maskbit+0xf/0x1c                            
>>  [<ffffffff8005e2fc>] call_softirq+0x1c/0x28                                    
>>  [<ffffffff8006c571>] do_softirq+0x2c/0x85                                      
>>  [<ffffffff8006c3f9>] do_IRQ+0xec/0xf5                                          
>>  [<ffffffff80056c64>] mwait_idle+0x0/0x4a                                       
>>  [<ffffffff8005d615>] ret_from_intr+0x0/0xa                                     
>>  <EOI>  [<ffffffff80056c9a>] mwait_idle+0x36/0x4a                               
>>  [<ffffffff80048b1d>] cpu_idle+0x95/0xb8                                        
>>  [<ffffffff80076667>] start_secondary+0x45a/0x469                               
>>                                                                                 
>>                                                                                 
>> Code: 49 8b 01 48 6b d2 38 48 83 e0 fc 48 01 d0 f0 0f ba 28 09 49               
>> RIP  [<ffffffff80149991>] mark_clean+0x50/0x77                                  
>>  RSP <ffff81022fc6fe18>
>>     
>
>   




More information about the general mailing list