[ewg] RDS problematic on RC2
Olaf Kirch
olaf.kirch at oracle.com
Wed Jan 16 23:55:50 PST 2008
On Thursday 17 January 2008 04:15, Johann George wrote:
> We've been testing the OFED 1.3 pre-releases on a 12 node cluster here
> at UNH-IOL. RDS seemed largely functional (other than problems we
> were aware of) on OFED 1.3 RC1. When we installed RC2, RDS stopped
> working. A dmesg indicates the following message repeatedly on the
Huh, scary. It works reasonably well here, though.
> console:
>
> RDS/IB: completion on 10.1.1.205 had status 9, disconnecting and reconnecting
That's a remote invalid request error. Were you testing with
RDMA or without? What user application were you using for testing?
> Note that this is using RDS over IB. Our minimal experience with the
> non-IB version of RDS was worse. We only tried it with RC1 and it
> crashed one of the two machines almost instantly.
Yes, the TCP part of RDS isn't being looked after very much, unfortunately.
Olaf
--
Olaf Kirch | --- o --- Nous sommes du soleil we love when we play
okir at lst.de | / | \ sol.dhoop.naytheet.ah kin.ir.samse.qurax
More information about the ewg
mailing list