[openib-general] EHCA crash on module unload?

Heiko J Schick info at schihei.de
Tue Apr 11 09:59:27 PDT 2006


Hello Troy,

did you unload first all OpenIB modules and then the eHCA module
or the other way around?

Can you see any other message (error data) in /var/log/messages?

It looks like you unloaded the module during an interrupt came in.
Can you sent us the steps / commands you've executed when the panic
was caused?

Regards,
	Heiko

On 11.04.2006, at 18:36, Troy Benjegerdes wrote:

> p5l2:/usr/src/linux-2.6.16/drivers/infiniband# svnversion .
> 5988
>
>
> p5l2:~# [86044.767087] Unable to handle kernel paging request for data
> at address 0x00000068
> [86044.767115] Faulting instruction address: 0xd000000018fd4b38
> [86044.767132] Oops: Kernel access of bad area, sig: 11 [#1]
> [86044.767149] SMP NR_CPUS=8 NUMA PSERIES LPAR
> [86044.767169] Modules linked in: ib_uverbs ib_umad ib_mad hcad_mod  
> ib_core libafs ipr sd_mod sg
> [86044.767197] NIP: D000000018FD4B38 LR: D000000018FD4AEC CTR:  
> C000000000160FF4
> [86044.767212] REGS: c0000001df42ba20 TRAP: 0300   Tainted: P  
> (2.6.16-power5)
> [86044.767225] MSR: 8000000000009032 <EE,ME,IR,DR>  CR: 22000024   
> XER: 20000020
> [86044.767270] DAR: 0000000000000068, DSISR: 0000000040000000
> [86044.767287] TASK = c0000003ad9c0dd0[2232] 'ehca/0' THREAD:  
> c0000001df428000 CPU: 0
> [86044.767303] GPR00: 0000000000000000 C0000001DF42BCA0  
> D000000018FFC350 0000000000000000
> [86044.767334] GPR04: 0000000000000001 0000000000060000  
> 0000000022000042 C000000002319460
> [86044.767366] GPR08: D000000000035208 D000000018FEF138  
> D000000000035000 0000000000000000
> [86044.767400] GPR12: D000000018FDBEA8 C000000000433C00  
> 0000000000000000 0000000000000000
> [86044.767439] GPR16: 0000000000000000 0000000000000000  
> 0000000000000000 0000000000000000
> [86044.767472] GPR20: 0000000000C00000 4000000001C10000  
> C0000000003E6AF0 0000000001FF6D50
> [86044.767493] GPR24: C000000002319490 0000000000000001  
> C000000002319458 C000000002319000
> [86044.767514] GPR28: C000000002319490 D000000018FEF3B8  
> D000000018FFA578 0000000000000000
> [86044.767537] NIP [D000000018FD4B38] .ehca_interrupt_eq 
> +0x1c4/0x550 [hcad_mod]
> [86044.767573] LR [D000000018FD4AEC] .ehca_interrupt_eq+0x178/0x550  
> [hcad_mod]
> [86044.767601] Call Trace:
> [86044.767609] [C0000001DF42BCA0]  
> [D000000018FD4A04] .ehca_interrupt_eq+0x90/0x550 [hcad_mod]  
> (unreliable)
> [86044.767643] --- Exception: 2 at .__start+0x4000000000000000/0x8
> [86044.767662]     LR = .kernel_thread+0x4c/0x68
> [86044.767673] [C0000001DF42BD50] [C0000000000561EC] .run_workqueue 
> +0xdc/0x168 (unreliable)
>
> [86044.767695] [C0000001DF42BDF0] [C0000000000564B0] .worker_thread 
> +0x128/0x198
>
> [86044.767714] [C0000001DF42BEE0] [C00000000005B450] .kthread 
> +0x120/0x170
>
> [86044.767731] [C0000001DF42BF90] [C000000000021CF8] .kernel_thread 
> +0x4c/0x68
>
> [86044.767746] Instruction dump:
> [86044.767754] 4c00012c 7c0007b4 2f80ffff 409c001c 5400043e  
> 2f800000 409e0010 7fa3eb78
> [86044.767805] 48007201 e8410028 e93e8000 3ca00006 <ebbf0068>  
> 60a50139 88090006 2b800007
> [86044.767863]
>
> _______________________________________________
> openib-general mailing list
> openib-general at openib.org
> http://openib.org/mailman/listinfo/openib-general
>
> To unsubscribe, please visit http://openib.org/mailman/listinfo/ 
> openib-general
>




More information about the general mailing list