[openib-general] [PATCH repost] IB/srp: destroy/recreate qp/cq at reconnect
Michael S. Tsirkin
mst at mellanox.co.il
Tue Oct 31 11:43:57 PST 2006
Quoting r. Michael S. Tsirkin <mst at mellanox.co.il>:
> Subject: [PATCH repost] IB/srp: destroy/recreate qp/cq at reconnect
>
> From: Ishai Rabinovitz <ishai at mellanox.co.il>
>
> This makes SRP more robust in presence of hardware errors
> and is closer to behaviour suggested by IB spec,
> reducing chance of stale packets.
>
> Signed-off-by: Ishai Rabinovitz <ishai at mellanox.co.il>
> Signed-off-by: Michael S. Tsirkin <mst at mellanox.co.il>
>
> ---
>
> Hello, Roland!
> What do you think about this? Please consider for 2.6.19.
>
> For some reason (could be a firmware problem) I got a CQ overrun in SRP.
> Because of that there was a QP FATAL. Since in srp_reconnect_target we are not
> destroying the QP, the QP FATAL persists after the reconnect.
> In order to be able to recover from such situation I suggest we
> destroy the CQ and the QP in every reconnect.
>
> This also corrects a minor spec in-compliance - when srp_reconnect_target
> is called, srp destroys the CM ID and resets the QP, the new connection
> will be retried with the same QPN which could theoretically lead to
> stale packets (for strict spec compliance I think QPN should not be reused
> till all stale packets are flushed out of the network).
Roland, what do you think about this patch?
Seems like a good idea, to me.
--
MST
More information about the general
mailing list