[ewg] RDS problematic on RC2

Olaf Kirch olaf.kirch at oracle.com
Wed Jan 16 23:55:50 PST 2008


On Thursday 17 January 2008 04:15, Johann George wrote:
> We've been testing the OFED 1.3 pre-releases on a 12 node cluster here
> at UNH-IOL.  RDS seemed largely functional (other than problems we
> were aware of) on OFED 1.3 RC1.  When we installed RC2, RDS stopped
> working.  A dmesg indicates the following message repeatedly on the

Huh, scary. It works reasonably well here, though.

> console:
> 
> RDS/IB: completion on 10.1.1.205 had status 9, disconnecting and reconnecting

That's a remote invalid request error. Were you testing with
RDMA or without? What user application were you using for testing?

> Note that this is using RDS over IB.  Our minimal experience with the
> non-IB version of RDS was worse.  We only tried it with RC1 and it
> crashed one of the two machines almost instantly.

Yes, the TCP part of RDS isn't being looked after very much, unfortunately.

Olaf
-- 
Olaf Kirch  |  --- o --- Nous sommes du soleil we love when we play
okir at lst.de |    / | \   sol.dhoop.naytheet.ah kin.ir.samse.qurax



More information about the ewg mailing list