[ofa-general] Re: [PATCH v7] IB/mlx4: shrinking WQE

Michael S. Tsirkin mst at dev.mellanox.co.il
Wed Sep 19 10:14:43 PDT 2007


> Quoting Roland Dreier <rdreier at cisco.com>:
> Subject: Re: [PATCH v7] IB/mlx4: shrinking WQE
> 
>  > ConnectX supports shrinking wqe, such that a single WR can include
>  > multiple units of wqe_shift.  This way, WRs can differ in size, and
>  > do not have to be a power of 2 in size, saving memory and speeding up
>  > send WR posting.
> 
> Given this added complexity:
> 
>  6 files changed, 226 insertions(+), 39 deletions(-)
> 
> and the unpleasantness of having if (BITS_PER_LONG == 64) various
> places,

I don't there's a way around that.
BTW, the vmap trick is an improvement in itself,
we can extend it to CQs, EQs etc easily.

> can you quantify the improvement this gives?

This gets me from 960 to 1020 MByte/sec on ipoib/cm with netperf.
SDP shows similiar gains.

> Would it make more sense to do this for userspace first?

Given that we want it, what does a delay buy us?

-- 
MST



More information about the general mailing list