[openib-general] scaling issues, was: uDAPL cma: add support for address and route retries, call disconnect when recving dreq

Michael S. Tsirkin mst at mellanox.co.il
Thu Nov 2 10:34:04 PST 2006


Quoting r. Sean Hefty <mshefty at ichips.intel.com>:
> Subject: scaling issues, was: uDAPL cma: add support for address and route retries, call disconnect when recving dreq
> 
> Or Gerlitz wrote:
> > Can be very nice if you share with the community the IB stack issues 
> > revealed under scale-out testing... basically what was the testbed?
> 
> We have a 256 node (512 processors) cluster that we can test with on the second 
> Tuesday following the first Monday of any month with two full moons.  We're only 
> now getting some time on the cluster, and our test capabilities are limited.
> 
> The main issue that we saw was that the SA simply doesn't scale.


We had an option to increase the RQ size for QP1 and QP0.
This might help you too: try increasing IB_MAD_QP_RECV_SIZE.


-- 
MST




More information about the general mailing list