[openib-general] ib_sa oops
Hal Rosenstock
halr at voltaire.com
Tue Nov 2 09:11:46 PST 2004
When I did a modprobe -r ib_ipoib, I got the following oops when the
SA's send_handler is called on it's deregistering it's MAD client with
pending MADs.
I first bringup and configure IPoIB:
/sbin/modprobe ib_ipoib
/sbin/ifconfig ib0 192.168.0.20
I then do:
ping -b 192.168.0.255
and ctl-C before it cycles around the list a second time and then:
/sbin/modprobe -r ib_ipoib
Segmentation fault
/var/log/messages showed:
Nov 2 10:54:17 hpc-1 kernel: Unable to handle kernel paging request at
virtual
address f8a50407
Nov 2 10:54:17 hpc-1 kernel: printing eip:
Nov 2 10:54:17 hpc-1 kernel: f8a50407
Nov 2 10:54:17 hpc-1 kernel: *pde = 019a5067
Nov 2 10:54:17 hpc-1 kernel: *pte = 00000000
Nov 2 10:54:17 hpc-1 kernel: Oops: 0000 [#1]
Nov 2 10:54:17 hpc-1 kernel: SMP
Nov 2 10:54:17 hpc-1 kernel: Modules linked in: ib_sa ib_mad
ib_services ib_mthca ib_core loop autofs e1000 ohci1394 ieee1394
parport_pc parport usbcore
Nov 2 10:54:17 hpc-1 kernel: CPU: 0
Nov 2 10:54:17 hpc-1 kernel: EIP: 0060:[<f8a50407>] Not tainted
VLI
Nov 2 10:54:17 hpc-1 kernel: EFLAGS: 00010246 (2.6.9)
Nov 2 10:54:17 hpc-1 kernel: EIP is at 0xf8a50407
Nov 2 10:54:17 hpc-1 kernel: eax: e2f05280 ebx: 00000286 ecx:
00000000 edx: fffffffb
Nov 2 10:54:17 hpc-1 kernel: esi: c6ba3340 edi: c6ba3348 ebp:
fffffffb esp: e6eebdfc
Nov 2 10:54:17 hpc-1 kernel: ds: 007b es: 007b ss: 0068
Nov 2 10:54:17 hpc-1 kernel: Process modprobe (pid: 12680,
threadinfo=e6eea000
task=f5f30230)
Nov 2 10:54:17 hpc-1 kernel: Stack: f8a217d8 fffffffb 00000000 e2f05280
e6eebe60 c02a1e5e 00000000 f5f30230
Nov 2 10:54:17 hpc-1 kernel: c0117d96 00000000 00000000 00000003
c170b060 c6ff3a70 c6ff3830 c011685a
Nov 2 10:54:17 hpc-1 kernel: f5f30230 e74b5800 f5f30230 00000000
e6eebe98 c02a1a92 c6ff3830 c170e4d0
Nov 2 10:54:17 hpc-1 kernel: Call Trace:
Nov 2 10:54:17 hpc-1 kernel: [<f8a217d8>]
ib_sa_mcmember_rec_callback+0x5a/0x7f [ib_sa]
Nov 2 10:54:17 hpc-1 kernel: [<c02a1e5e>]
wait_for_completion+0xc4/0xcc
Nov 2 10:54:17 hpc-1 kernel: [<c0117d96>]
default_wake_function+0x0/0x12
Nov 2 10:54:17 hpc-1 kernel: [<c011685a>] finish_task_switch+0x3a/0x83
Nov 2 10:54:17 hpc-1 kernel: [<c02a1a92>] schedule+0x326/0x62e
Nov 2 10:54:17 hpc-1 kernel: [<f8a21a24>] send_handler+0xaa/0xbc
[ib_sa]
Nov 2 10:54:17 hpc-1 kernel: [<f89e8642>] cancel_mads+0xe5/0x127
[ib_mad]
Nov 2 10:54:17 hpc-1 kernel: [<f89e737a>]
ib_unregister_mad_agent+0x16/0x135 [ib_mad]
Nov 2 10:54:17 hpc-1 kernel: [<c0117d96>]
default_wake_function+0x0/0x12
Nov 2 10:54:17 hpc-1 kernel: [<c0117d96>]
default_wake_function+0x0/0x12
Nov 2 10:54:17 hpc-1 kernel: [<f89f2928>] ib_get_client_data+0x42/0x4e
[ib_core]
Nov 2 10:54:17 hpc-1 kernel: [<f8a21d87>] ib_sa_remove_one+0x44/0x7d
[ib_sa]
Nov 2 10:54:17 hpc-1 kernel: [<f89f28e1>]
ib_unregister_client+0xee/0xf3 [ib_core]
Nov 2 10:54:17 hpc-1 kernel: [<c0130ecb>] try_stop_module+0x37/0x3b
Nov 2 10:54:17 hpc-1 kernel: [<c0133941>] __try_stop_module+0x0/0x41
Nov 2 10:54:17 hpc-1 kernel: [<f8a21dcf>] ib_sa_cleanup+0xf/0x13
[ib_sa]
Nov 2 10:54:17 hpc-1 kernel: [<c01310c1>]
sys_delete_module+0x16d/0x19b
Nov 2 10:54:17 hpc-1 kernel: [<c01474de>] sys_munmap+0x51/0x76
Nov 2 10:54:17 hpc-1 kernel: [<c0105cf5>] sysenter_past_esp+0x52/0x71
Nov 2 10:54:17 hpc-1 kernel: Code: Bad EIP value.
-- Hal
More information about the general
mailing list