[Scst-devel] [ofa-general] WinOF_2_0_5/SRP initiator: slow reads and eventually hangs
Chris Worley
worleys at gmail.com
Sun Sep 13 07:57:03 PDT 2009
On Sat, Sep 12, 2009 at 2:18 PM, Bart Van
Assche<bart.vanassche at gmail.com> wrote:
> On Wed, Sep 9, 2009 at 8:38 PM, Vladislav Bolkhovitin <vst at vlnb.net> wrote:
>> Chris Worley, on 09/09/2009 02:29 AM wrote:
>>> [ ... ]
>>> Sep 8 22:40:39 nameme kernel: end_request: I/O error, dev sde, sector
>>> 123559464
>>> Sep 8 22:40:39 nameme kernel: device-mapper: multipath: Failing path
>>> 8:64.
>>> Sep 8 22:41:32 nameme kernel: host27: ib_srp: DREQ received -
>>> connection closed
>>> Sep 8 22:41:32 nameme kernel: host28: ib_srp: DREQ received -
>>> connection closed
>>> Sep 8 22:41:32 nameme kernel: host29: ib_srp: DREQ received -
>>> connection closed
>>> Sep 8 22:41:32 nameme kernel: host30: ib_srp: DREQ received -
>>> connection closed
>>> Sep 8 22:41:32 nameme kernel: host31: ib_srp: DREQ received -
>>> connection closed
>>> Sep 8 22:41:32 nameme kernel: host32: ib_srp: DREQ received -
>>> connection closed
>>> Sep 8 22:41:34 nameme kernel: host27: ib_srp: connection closed
>>> Sep 8 22:41:34 nameme kernel: ib_srp: host27: add qp_in_err timer
>>> Sep 8 22:41:34 nameme kernel: host28: ib_srp: connection closed
>>> Sep 8 22:41:34 nameme kernel: ib_srp: host28: add qp_in_err timer
>>> Sep 8 22:41:34 nameme kernel: host29: ib_srp: connection closed
>>> Sep 8 22:41:34 nameme kernel: ib_srp: host29: add qp_in_err timer
>>> Sep 8 22:41:34 nameme kernel: host30: ib_srp: connection closed
>>> Sep 8 22:41:34 nameme kernel: ib_srp: host30: add qp_in_err timer
>>> Sep 8 22:41:34 nameme kernel: host31: ib_srp: connection closed
>>> Sep 8 22:41:34 nameme kernel: ib_srp: host31: add qp_in_err timer
>>> Sep 8 22:41:34 nameme kernel: host32: ib_srp: connection closed
>>> Sep 8 22:41:34 nameme kernel: ib_srp: host32: add qp_in_err timer
>>> Sep 8 22:41:59 nameme kernel: host27: ib_srp: srp_qp_in_err_timer called
>>> Sep 8 22:41:59 nameme kernel: host27: ib_srp: srp_qp_in_err_timer
>>> flushed reset - done
>>> Sep 8 22:41:59 nameme kernel: host28: ib_srp: srp_qp_in_err_timer called
>>> Sep 8 22:41:59 nameme kernel: host28: ib_srp: srp_qp_in_err_timer
>>> flushed reset - done
>>> Sep 8 22:41:59 nameme kernel: host29: ib_srp: srp_qp_in_err_timer called
>>> Sep 8 22:41:59 nameme kernel: host29: ib_srp: srp_qp_in_err_timer
>>> flushed reset - done
>>> Sep 8 22:41:59 nameme kernel: host30: ib_srp: srp_qp_in_err_timer called
>>> Sep 8 22:41:59 nameme kernel: host30: ib_srp: srp_qp_in_err_timer
>>> flushed reset - done
>>> Sep 8 22:41:59 nameme kernel: host31: ib_srp: srp_qp_in_err_timer called
>>> Sep 8 22:41:59 nameme kernel: host31: ib_srp: srp_qp_in_err_timer
>>> flushed reset - done
>>> Sep 8 22:41:59 nameme kernel: host32: ib_srp: srp_qp_in_err_timer called
>>> Sep 8 22:41:59 nameme kernel: host32: ib_srp: srp_qp_in_err_timer
>>
>> Those messages should be analyzed and can be a key.
>
> The above messages mean that the SRP initiator correctly detected that
> the SRP target closed six SRP connections. But how did you get six SRP
> connections ? Are there six HCA's in the target system or are there
> six different target systems ? And how has the device mapper been
> configured on the initiator ?
The only thing there were six of are LVM volumes. One HCA, using both
ports; 10 target drives being exported from one target.
Chris
>
> Bart.
>
More information about the general
mailing list