[ofa-general] How to recover from a bad MAD status (110) from lid 6

Chuck Baker Charles.Baker at Sun.COM
Wed Apr 8 09:35:24 PDT 2009


Thanks for the response.

It turned out to be that the target hung, and a case of new user syndrome.

All are back up and operational.

thanks
chuck

On 04/08/09 04:59, Hal Rosenstock wrote:
> On Tue, Apr 7, 2009 at 4:01 PM, Chuck Baker <Charles.Baker at sun.com> wrote:
>   
>> Hi,
>>
>> I encountered an error while running load tests on RHEL5.2 OFED 1.4
>> connected
>> to SRP targets, and am wondering how to recover.
>>
>> The error's I'm seeing is an I/O failed with EIO I/O error messages, and my
>> load
>> generator failed.
>>
>> Since the failure, the srp_daemon reports
>>
>> srp_daemon -a -o -c -n -i mthca0
>> 07/03/09 10:54:13 : bad MAD status (110) from lid 6
>>     
>
> 110 is ETIMEDOUT
>  id_ext=0003ba0001005504,ioc_guid=0003ba0001005504,dgid=fe800000000000000003ba0001005506,pkey=ffff,service_id=0003ba0001005504,initiator_ext=0655000100ba0300
>   
>> id_ext=0003ba000100575c,ioc_guid=0003ba000100575c,dgid=fe800000000000000003ba000100575e,pkey=ffff,service_id=0003ba000100575c,initiator_ext=5e57000100ba0300
>>
>> rebooting the target and then rebooting the initiator has made no
>> difference.
>>     
>
> Sounds like some sort of network issue.
>
>   
>> Any ideas on how to resolve this would be appreciated.
>>     
>
> Can you smpquery between initiator and target ? If so, what about perfquery ?
>
> Have you tried rebooting your SM ?
>
> -- Hal
>
>   
>> thanks
>> chuck
>>     
> _______________________________________________
> general mailing list
> general at lists.openfabrics.org
> http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general
>
> To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general
>   
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openfabrics.org/pipermail/general/attachments/20090408/09686f73/attachment.html>


More information about the general mailing list