[openib-general][patch review] srp: fmr implementation,

Vu Pham vuhuong at mellanox.com
Fri Apr 21 09:20:24 PDT 2006


Hi Roland,

> 
> I reported the error from my original email responding to your fmr 
> patch. For ia64 system with pcix hca I got asyn event 
> IB_EVENT_QP_ACCESS_ERR at the initiator (and I got cqe with 
> IB_COMPLETION_STATUS_REMOTE_ACCESS_ERROR status at my target)
> I still have not had an IB analyzer trace (as you suggested)

I still have not had the IB trace yet.

> 
>>
>> So the SCSI midlayer times out commands and tries to abort them.  But
>> we have no connection so the abort fails.  The SCSI command shouldn't
>> get freed now (at least if I'm understanding scsi_error.c correctly).
>>
>> Then we have no .eh_device_reset_handler so everything should fall
>> through to calling our .eh_host_reset_handler without freeing any SCSI
>> commands.  And then we crash on a use-after-free of a SCSI command.
>>
>> So where is that command getting freed on us??
>>
> 
> The scsi command that is used by error handlers (.eh_abort_handler, 
> .eh_host_reset_handler...) is not the same as use-after-free scsi 
> command from req->scmnd
> 
> There is some glitch that the scsi command from req->scmnd already freed 
> by scsi midlayer; however, the request is still in our pending request 
> queue

With the following patch applied my ia64 system does not crash anymore

I prepare this patch diffing from srp revision 6455 applied with 
srp-params.patch that I sent you last week

Please let me know if you want this patch generated from current srp 
(revision 6550)

What is the status for srp-params.patch (introducing tuned parameters)

Thanks,
Vu


-------------- next part --------------
A non-text attachment was scrubbed...
Name: srp_eh.patch
Type: text/x-patch
Size: 1673 bytes
Desc: not available
URL: <http://lists.openfabrics.org/pipermail/general/attachments/20060421/0b850167/attachment.bin>


More information about the general mailing list