[openib-general] Catastrophic error detected.

Michael S. Tsirkin mst at mellanox.co.il
Wed Oct 18 13:24:00 PDT 2006


Quoting r. Ira Weiny <weiny2 at llnl.gov>:
> Subject: Catastrophic error detected.
> 
> I got the following error running with OFED 1.1 on a modified 2.6.9 RHEL4
> kernel.  Hal mentioned that there might be a catastrophic error recovery patch
> submitted since then?  I can't find a mention of that in the mailing list.  If
> possible I would like to try such a patch.
> 
> Thanks,
> Ira
> 
> 2006-10-17 21:31:47 ib_mthca 0000:07:00.0: Catastrophic error detected: unknown error
> 2006-10-17 21:31:47 ib_mthca 0000:07:00.0:   buf[00]: ffffffff
> 2006-10-17 21:31:47 ib_mthca 0000:07:00.0:   buf[01]: ffffffff
> 2006-10-17 21:31:47 ib_mthca 0000:07:00.0:   buf[02]: ffffffff
> 2006-10-17 21:31:47 ib_mthca 0000:07:00.0:   buf[03]: ffffffff
> 2006-10-17 21:31:47 ib_mthca 0000:07:00.0:   buf[04]: ffffffff
> 2006-10-17 21:31:47 ib_mthca 0000:07:00.0:   buf[05]: ffffffff
> 2006-10-17 21:31:47 ib_mthca 0000:07:00.0:   buf[06]: ffffffff
> 2006-10-17 21:31:47 ib_mthca 0000:07:00.0:   buf[07]: ffffffff
> 2006-10-17 21:31:47 ib_mthca 0000:07:00.0:   buf[08]: ffffffff
> 2006-10-17 21:31:47 ib_mthca 0000:07:00.0:   buf[09]: ffffffff
> 2006-10-17 21:31:47 ib_mthca 0000:07:00.0:   buf[0a]: ffffffff
> 2006-10-17 21:31:47 ib_mthca 0000:07:00.0:   buf[0b]: ffffffff
> 2006-10-17 21:31:47 ib_mthca 0000:07:00.0:   buf[0c]: ffffffff
> 2006-10-17 21:31:47 ib_mthca 0000:07:00.0:   buf[0d]: ffffffff
> 2006-10-17 21:31:47 ib_mthca 0000:07:00.0:   buf[0e]: ffffffff
> 2006-10-17 21:31:47 ib_mthca 0000:07:00.0:   buf[0f]: ffffffff

OFED 1.1 will already try to recover. But the fact that you got ffffffff
indicates its a hard error that we couldn't recover from.

-- 
MST




More information about the general mailing list