[ofa-general] rdma_cm kernel Oops

Vladimir Sokolovsky vlad at mellanox.co.il
Tue Jun 5 05:14:47 PDT 2007


Hi Sean,
I got the following kernel oops while testing RDS HA (kernel 2.6.20):

rdma_destroy_id+0x124/0x193 corresponds to the line 778 in drivers/infiniband/core/cma.c

    771 static void cma_release_port(struct rdma_id_private *id_priv)
    772 {
    773         struct rdma_bind_list *bind_list = id_priv->bind_list;
    774
    775         if (!bind_list)
    776                 return;
    777
    778         mutex_lock(&lock);
    779         hlist_del(&id_priv->node);
    780         if (hlist_empty(&bind_list->owners)) {
    781                 idr_remove(bind_list->ps, bind_list->port);
    782                 kfree(bind_list);
    783         }
    784         mutex_unlock(&lock);
    785 }


Oops:
Jun[  645.944058] Pid: 7354, comm: rdma_cm_wq Not tainted 2.6.20 #2
  5 09:11:48 sw1[  645.944061] RIP: 0010:[<ffffffff8819aa7c>]  [<ffffffff8819aa7c>] :rdma_cm:rdma_destroy_id+0x124/0x193
23 kernel: [  64[  645.944072] RSP: 0018:ffff81011f223e30  EFLAGS: 00010206
5.816913] rds_sh[  645.944076] RAX: 0000000000100100 RBX: ffff81011d86d340 RCX: ffff8101224d0350
utdown_worker: w[  645.944080] RDX: 0000000000200200 RSI: 0000000000000056 RDI: ffffffff881a2140
as_conn 0 was_co[  645.944084] RBP: ffff8101224d0270 R08: 0000000000000000 R09: 0000000000000000
nning -1
[  645.944087] R10: ffff81011f223d50 R11: 0000000000000048 R12: 0000000000000001
[  645.944091] R13: 0000000000000287 R14: ffffffff8819b445 R15: 0000000000000000
[  645.944095] FS:  0000000000000000(0000) GS:ffffffff8058e000(0000) knlGS:0000000000000000
[  645.944099] CS:  0010 DS: 0018 ES: 0018 CR0: 000000008005003b
[  645.944103] CR2: 0000000000200200 CR3: 000000011e21d000 CR4: 00000000000006e0
[  645.944107] Process rdma_cm_wq (pid: 7354, threadinfo ffff81011f222000, task ffff810117df8830)
[  645.944110] Stack:  ffff8101224d0270 ffff8101224d0270 ffff81011a850a20 ffffffff8819b4a7
[  645.944119]  ffff81011a850a28 ffff81011d89ea48 ffff81011a850a20 ffffffff80239c4e
[  645.944126]  ffff81011d89ea48 ffffffff80239ced ffff8101201c7d98 00000000fffffffc
[  645.944132] Call Trace:
[  645.944143]  [<ffffffff8819b4a7>] :rdma_cm:cma_work_handler+0x62/0x6e
[  645.944153]  [<ffffffff80239c4e>] run_workqueue+0xa5/0x144
[  645.944159]  [<ffffffff80239ced>] worker_thread+0x0/0x165
[  645.944164]  [<ffffffff8023cc58>] keventd_create_kthread+0x0/0x6a
[  645.944169]  [<ffffffff80239e1c>] worker_thread+0x12f/0x165
[  645.944177]  [<ffffffff80225003>] default_wake_function+0x0/0xe
[  645.944184]  [<ffffffff80225003>] default_wake_function+0x0/0xe
[  645.944190]  [<ffffffff8023cc2f>] kthread+0xc8/0xf1
[  645.944198]  [<ffffffff8020a2b8>] child_rip+0xa/0x12
[  645.944203]  [<ffffffff8023cc58>] keventd_create_kthread+0x0/0x6a
[  645.944213]  [<ffffffff8023cb67>] kthread+0x0/0xf1
[  645.944217]  [<ffffffff8020a2ae>] child_rip+0x0/0x12
[  645.944221] 
[  645.944223] 
[  645.944224] Code: 48 89 02 74 04 48 89 50 08 48 c7 85 e0 00 00 00 00 01 10 00 
[  645.944236] RIP  [<ffffffff8819aa7c>] :rdma_cm:rdma_destroy_id+0x124/0x193
[  645.944246]  RSP <ffff81011f223e30>
[  645.944249] CR2: 0000000000200200
[  645.944251]  <4>created cm id ffff8101224d0270 for conn ffff81011c857d48

-- 
Vladimir Sokolovsky <vlad at mellanox.co.il>
Mellanox Technologies Ltd.



More information about the general mailing list