[ewg] Re: [ofa-general] questions about OFED 1.2 IPoIB bonding
Roland Dreier
rdreier at cisco.com
Tue Apr 10 15:34:57 PDT 2007
> > HCA catastrophic errors are either a hardware problem (either a
> > transient condition like overheating, or a busted HCA), or a firmware
> > bug.
>
> Not really, since most kernel code uses the DMA MR,
> they can easily be triggered by e.g. incorrect DMA API usage.
> I've just seen this with the recent PPC bug.
Out of curiousity, why does this cause a catastrophic error? I would
have thought a work request with a bogus bus address would generate an
affiliated error, since you know exactly resource what caused the bad
transaction.
- R.
More information about the ewg
mailing list