[ofa-general] [PATCH] IB/core: handle race between elements in qork queues after event
Moni Shoua
monis at Voltaire.COM
Wed May 14 00:41:19 PDT 2008
Roland Dreier wrote:
> > Can we please go on with this patch? We would like to see it in the next kernel.
>
> I still don't get why this is important to you. Is there a concrete
> example of a situation where this actually makes a measurable difference?
>
> We need some justification for adding this locking complexity beyond "it
> doesn't hurt." (And also of course we need it fixed so there aren't races)
>
> - R.
> _______________________________________________
> general mailing list
> general at lists.openfabrics.org
> http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general
>
> To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general
>
Hi,
OK. Here is an example that was viewed in our tests.
One IPoIB host (client) sends a stream of multicast packets to another IPoIB host (server).
SM takeover event takes place during traffic and as a result multicast info is flushed
and there is a need to rejoin by hosts. Without the patch there is a chance (which according to our experience
is a very big chance) that the request to rejoin will be to the old SM and only after a retry join completes successfully.
This takes too long and the patch solves it.
I hope that this is convincing enough for you because for us it is important that a
recovery from a failure will be as quick as possible.
thanks
MoniS
More information about the general
mailing list