[ofa-general] Re: lockdep question (was Re: IPoIB caused a kernel: BUG: soft lockup detected on CPU#0!)

Michael S. Tsirkin mst at mellanox.co.il
Sun Mar 11 08:04:31 PDT 2007


After adding some printks, I started getting these:

[  597.036720] BUG: MAX_STACK_TRACE_ENTRIES too low!
[  597.041546] turning off the locking correctness validator.
[  597.047135]  [<c023a922>] save_trace+0x8a/0x8f
[  597.051751]  [<c023ae8c>] mark_lock+0x65/0x3ff
[  597.056366]  [<c023a8d6>] save_trace+0x3e/0x8f
[  597.060980]  [<c023a9f0>] add_lock_to_list+0x62/0x85
[  597.066116]  [<c023b992>] __lock_acquire+0x3f4/0xbb9
[  597.071252]  [<f89da11f>] send_mad+0x79/0x103 [ib_sa]
[  597.076474]  [<c031a475>] idr_get_new_above_int+0x13c/0x216
[  597.082225]  [<c023c43d>] lock_acquire+0x56/0x71
[  597.087018]  [<f89da11f>] send_mad+0x79/0x103 [ib_sa]
[  597.092240]  [<c051ad41>] _spin_lock_irqsave+0x32/0x41
[  597.097547]  [<f89da11f>] send_mad+0x79/0x103 [ib_sa]
[  597.102770]  [<f89da11f>] send_mad+0x79/0x103 [ib_sa]
[  597.107989]  [<f89da8d9>] ib_sa_path_rec_get+0x134/0x172 [ib_sa]
[  597.114166]  [<f899b73f>] path_rec_start+0x115/0x143 [ib_ipoib]
[  597.120254]  [<f899cb38>] path_rec_completion+0x0/0x4f4 [ib_ipoib]
[  597.126610]  [<f899b874>] path_rec_create+0x77/0x9d [ib_ipoib]
[  597.132617]  [<f899c9fe>] ipoib_start_xmit+0x441/0x57b [ib_ipoib]
[  597.138888]  [<c051ae06>] _spin_unlock_irqrestore+0x34/0x39
[  597.144635]  [<c023b435>] trace_hardirqs_on+0x106/0x141
[  597.150035]  [<c04a058b>] dev_queue_xmit+0x109/0x245
[  597.155167]  [<c022ae27>] __mod_timer+0x94/0x9e
[  597.159871]  [<c04a0423>] dev_hard_start_xmit+0x1be/0x21d
[  597.165438]  [<c04a9fa9>] __qdisc_run+0xd7/0x190
[  597.170226]  [<c04a05b7>] dev_queue_xmit+0x135/0x245
[  597.175360]  [<c04ce267>] arp_process+0x2c0/0x512
[  597.180234]  [<f8954346>] mthca_tavor_interrupt+0xf3/0x12b [ib_mthca]
[  597.186855]  [<c04a088b>] netif_receive_skb+0x1c4/0x1da
[  597.192254]  [<c023b435>] trace_hardirqs_on+0x106/0x141
[  597.197648]  [<c04a0935>] process_backlog+0x94/0x107
[  597.202785]  [<c049f02b>] net_rx_action+0x9a/0x15e
[  597.207743]  [<c0227643>] __do_softirq+0x69/0xd5
[  597.212530]  [<c02276e6>] do_softirq+0x37/0x4d
[  597.217147]  [<c020617e>] do_IRQ+0x5c/0x72
[  597.221415]  [<c0204b52>] common_interrupt+0x2e/0x34
[  597.226549]  [<c02029ef>] default_idle+0x3b/0x54
[  597.231337]  [<c02029f1>] default_idle+0x3d/0x54
[  597.236124]  [<c0202aaa>] cpu_idle+0xa2/0xbb
[  597.240567]  =======================

And sometimes these:

[  404.493572] KERNEL: assertion (!timer_pending(&dev->watchdog_timer)) failed at net/sched/sch_generic.c (608)

-- 
MST



More information about the general mailing list