[ofa-general] Recalculated Queue Sizes caused mthca Catastrophic Errors

Tziporet Koren tziporet at dev.mellanox.co.il
Wed Feb 20 23:47:03 PST 2008


Roland Dreier wrote:
>  > My code creates a CQ of size 2072, and a SQ of size 2056, and a RQ of
>  > size 16.  As you can see, CQ = SQ + RQ.  So, I should never overflow my
>  > CQ.
>  > 
>  > The Driver raises each of these to the next power of two.  So, we get a
>  > CQ of size 4096, a SQ of size 4096, and an RQ of size 16.
>  > 
>  > As you can see, CQ < SQ + RQ, so it is possible to overflow the CQ.
>  > 
>  > I don't think that this should cause the Firmware to generate a
>  > Catastrophic error (sounds like a bug in the firmware, if you ask me).
>
> Yes, as the release notes mention, it appears to be a
> hardware/firmware bug that you get a catastrophic error.  However,
> overflowing a CQ will generate a CQ overrun asynchronous error in the
> best case.
>
>   
Can you send the test you used and the FW version?
I wish to see if we already fixed this FW bug and if not we will 
reproduce problem here and fix it.

Thanks,
Tziporety




More information about the general mailing list