[openib-general] client-server small message performance issues

Pete Wyckoff pw at osc.edu
Mon Oct 23 16:08:25 PDT 2006


rdreier at cisco.com wrote on Tue, 17 Oct 2006 14:24 -0700:
>  > Basic ping pong is 25 us.  That's fine as this is not a particularly
>  > optimal way to communicate.  Each additional server adds 6 us.  That
>  > seems like a lot of overhead just to do another pair of posts and
>  > polls, but not my major complaint.  Look at the jump from 6 to 7
>  > servers, 41 us.  Beyond that, too.  And the standard deviation
>  > becomes huge.  A plot of the individual values shows a large spread,
>  > not just a few outliers.
> 
>  > The hardware is all Mellanox MT25204
> 
> I would guess you are seeing the effect of exceeding the size of some
> internal HCA cache, maybe the QP state cache.  But I don't know enough
> details of the HCA internals to know if this is true and if so which
> limit you're hitting.

Mellanox picked up on my email and sent me new firmware that
contains some optimizations for that particular silicon.  The
pre-release firmware image makes the numbers look much more
reasonable.

		-- Pete





More information about the general mailing list