[ewg] Re: [ofa-general] questions about OFED 1.2 IPoIB bonding
    Roland Dreier 
    rdreier at cisco.com
       
    Tue Apr 10 15:34:57 PDT 2007
    
    
  
 > > HCA catastrophic errors are either a hardware problem (either a
 > > transient condition like overheating, or a busted HCA), or a firmware
 > > bug.
 > 
 > Not really, since most kernel code uses the DMA MR,
 > they can easily be triggered by e.g. incorrect DMA API usage.
 > I've just seen this with the recent PPC bug.
Out of curiousity, why does this cause a catastrophic error?  I would
have thought a work request with a bogus bus address would generate an
affiliated error, since you know exactly resource what caused the bad
transaction.
 - R.
    
    
More information about the ewg
mailing list