[ofa-general] Recalculated Queue Sizes caused mthca Catastrophic Errors
Tziporet Koren
tziporet at dev.mellanox.co.il
Wed Feb 20 23:47:03 PST 2008
Roland Dreier wrote:
> > My code creates a CQ of size 2072, and a SQ of size 2056, and a RQ of
> > size 16. As you can see, CQ = SQ + RQ. So, I should never overflow my
> > CQ.
> >
> > The Driver raises each of these to the next power of two. So, we get a
> > CQ of size 4096, a SQ of size 4096, and an RQ of size 16.
> >
> > As you can see, CQ < SQ + RQ, so it is possible to overflow the CQ.
> >
> > I don't think that this should cause the Firmware to generate a
> > Catastrophic error (sounds like a bug in the firmware, if you ask me).
>
> Yes, as the release notes mention, it appears to be a
> hardware/firmware bug that you get a catastrophic error. However,
> overflowing a CQ will generate a CQ overrun asynchronous error in the
> best case.
>
>
Can you send the test you used and the FW version?
I wish to see if we already fixed this FW bug and if not we will
reproduce problem here and fix it.
Thanks,
Tziporety
More information about the general
mailing list