[openib-general] OpenSM causes kernel trap
Jay Higley
higley at dbresearch.net
Thu Oct 27 10:28:47 PDT 2005
I am trying to start up opensm on a Dell PowerEdge 2850 with a Mellanox
based infiniband card. We are using the x86-64 Architecture. The
kernel is recompiled with the latest stack from subversion, and all of
the modules load OK. However, when I try to start opensm I get the
following error. After this, then modules can not be successfully
removed from the kernel and opensm is not successfully running. I can
send the output from opensm's log file if anyone is interested. Thanks.
-Jay Higley
Oct 27 12:07:17 riba OpenSM[3321]: OpenSM Rev:openib-1.1.0
Oct 27 12:07:17 riba kernel: Unable to handle kernel paging request at
ffffffffffffffff RIP:
Oct 27 12:07:17 riba kernel: <ffffffff8016d8db>{kfree+107}
Oct 27 12:07:17 riba kernel: PGD 103027 PUD 5619067 PMD 0
Oct 27 12:07:17 riba kernel: Oops: 0000 [1] SMP
Oct 27 12:07:17 riba kernel: CPU 3
Oct 27 12:07:17 riba kernel: Modules linked in: nfsd exportfs lockd
nfs_acl ipv6 sunrpc ib_uverbs ib_at ib_sdp ib_ucm ib_cm ib_ping ib_mthca
ib_umad binfmt_misc dm_mod video thermal processor fan container button
battery ac ehci_hcd uhci_hcd pcspkr floppy parport_pc parport ib_ipoib
ib_sa ib_mad ib_core e1000 snd_pcm_oss snd_pcm snd_timer snd_page_alloc
snd_mixer_oss snd soundcore ext3 jbd megaraid_mbox megaraid_mm sd_mod
scsi_mod
Oct 27 12:07:17 riba kernel: Pid: 1783, comm: ib_mad1 Not tainted
2.6.13.4-86.caos.smp
Oct 27 12:07:17 riba kernel: RIP: 0010:[<ffffffff8016d8db>]
<ffffffff8016d8db>{kfree+107}
Oct 27 12:07:17 riba kernel: RSP: 0018:ffff81013df97db8 EFLAGS: 00010006
Oct 27 12:07:17 riba kernel: RAX: 0000000000000003 RBX: ffffffffffffffff
RCX: ffff81013fd93518
Oct 27 12:07:17 riba kernel: RDX: 0000000000762000 RSI: 0000000000000292
RDI: ffff810004b02028
Oct 27 12:07:17 riba kernel: RBP: ffff81010e000000 R08: ffff81013df96000
R09: 0000000000000000
Oct 27 12:07:17 riba kernel: R10: 0000000000000001 R11: 00000000ffffffff
R12: ffff81013e600e10
Oct 27 12:07:17 riba kernel: R13: ffff810037deb000 R14: ffff81013e600e78
R15: ffffffff880e5190
Oct 27 12:07:17 riba kernel: FS: 0000000000000000(0000)
GS:ffffffff804f3980(0000) knlGS:0000000000000000
Oct 27 12:07:17 riba kernel: CS: 0010 DS: 0018 ES: 0018 CR0:
000000008005003b
Oct 27 12:07:17 riba kernel: CR2: ffffffffffffffff CR3: 000000013907a000
CR4: 00000000000006e0
Oct 27 12:07:17 riba kernel: Process ib_mad1 (pid: 1783, threadinfo
ffff81013df96000, task ffff81013e40a1b0)
Oct 27 12:07:17 riba kernel: Stack: 0000000000000286 ffff81013e600e10
ffff81013f3db180 ffffffff880e272e
Oct 27 12:07:17 riba kernel: ffff81013df97e28 ffffffff8817113f
ffff81013e40a3c8 ffff81013fd93500
Oct 27 12:07:17 riba kernel: ffff81013e600e00 0000000000000292
Oct 27 12:07:17 riba kernel: Call
Trace:<ffffffff880e272e>{:ib_mad:ib_free_send_mad+14}
<ffffffff8817113f>{:ib_umad:send_handler+63}
Oct 27 12:07:17 riba kernel:
<ffffffff880e5324>{:ib_mad:timeout_sends+404}
<ffffffff801348b3>{__wake_up+67}
Oct 27 12:07:17 riba kernel:
<ffffffff8014beb2>{worker_thread+498}
<ffffffff801347f0>{default_wake_function+0}
Oct 27 12:07:17 riba kernel:
<ffffffff80134840>{__wake_up_common+64}
<ffffffff801347f0>{default_wake_function+0}
Oct 27 12:07:17 riba kernel:
<ffffffff80150990>{keventd_create_kthread+0}
<ffffffff8014bcc0>{worker_thread+0}
Oct 27 12:07:17 riba kernel:
<ffffffff80150990>{keventd_create_kthread+0} <ffffffff80150949>{kthread+217}
Oct 27 12:07:17 riba kernel: <ffffffff8010f9da>{child_rip+8}
<ffffffff80150990>{keventd_create_kthread+0}
Oct 27 12:07:17 riba kernel: <ffffffff80150870>{kthread+0}
<ffffffff8010f9d2>{child_rip+0}
Oct 27 12:07:17 riba kernel:
Oct 27 12:07:17 riba kernel:
Oct 27 12:07:17 riba kernel: Code: 8b 03 3b 43 04 73 04 89 c0 eb 0a 48
89 de e8 a2 03 00 00 8b
Oct 27 12:07:17 riba kernel: RIP <ffffffff8016d8db>{kfree+107} RSP
<ffff81013df97db8>
Oct 27 12:07:17 riba kernel: CR2: ffffffffffffffff
More information about the general
mailing list