[Openib-windows] duplicate socket deadlock in WSD

Yossi Leybovich sleybo at mellanox.co.il
Thu Oct 27 10:09:46 PDT 2005


Fab

I think there is deadlock in the duplicate socket flow

duplicate socket call wait_cq_drain with sock_info->mutex acquire
(ibsp_duplicate.c line 313) 
and even in the busy wait loop (wait_cq_drain function ) in its does not
release the mutex and wait to the counters to be 0.

But in the completion function (copletion_wq) in case of flush in error the
code try to acquire the mutex
so the completion function will not cont. and we are in deadlock

simple patch that solve it is to release the mutex before the sleep and
acquire it after the sleep 

10x
Yossi 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openfabrics.org/pipermail/ofw/attachments/20051027/d589efb3/attachment.html>


More information about the ofw mailing list