[openib-general] Re: IBM utilizing and testing openIB release 1 RCx
Kevin Reilly
kjreilly at us.ibm.com
Sat Mar 25 18:20:50 PST 2006
Thanks Mike,
Certainly increasing the default send/recv queues sizes is better then
what is in
the current release-1 openIB stream. I think our admins are setting up
different sizes
on the file-servers then the clients. We could just make the defaults be
the worst case
but that doesn't seem right to use more memory then needed on a server.
Some servers can be big powerful file servers with lots of TCP
connections, some server
might be 1 or 2 way blades and some might have there storage not going over
the IB network
and using IB for only MPI traffic.
What do you think about a simple module load parameter to let the admin
set the send and
recv queue values to what they need on that server? Each HCA vendor can say
what the max
they support is.
Kevin J. Reilly
STSM, HPC Architecture
-Federation/HPS Chief Engineer
-HPC interconnect architect
(office) 845-433-7976 (tieline) 8-293-7976
"Michael S.
Tsirkin"
<mst at mellanox.co. To
il> Kevin Reilly/Poughkeepsie/IBM at IBMUS
cc
03/25/2006 12:04 openib-general at openib.org
PM Subject
Re: IBM utilizing and testing
openIB release 1 RCx
Please respond to
"Michael S.
Tsirkin"
<mst at mellanox.co.
il>
Quoting r. Kevin Reilly <kjreilly at us.ibm.com>:
> The biggest problem we have at the moment is being able to tune the
> IPoverIB drivers sendq_size and recvq_size. We simply
> can get IP to work without tuning these parameters in some situations.
For
> example, running our cluster filesystem file server over
> IPoverIB. The server that is a filesystem server needs to have many
socket
> connections open and we use a big 8 or 16 way server
> for this purpose. We very quickly overrun the IPoverIB drivers send and
> receive queue. We are submitting a patch so we can tune
> these because we couldn't get things to work with the defaults and not
way
> to tune.
Which queue sizes do you use?
Maybe we can just have everyone use the bigger values and be done with it.
--
Michael S. Tsirkin
Staff Engineer, Mellanox Technologies
More information about the general
mailing list