[openfabrics-ewg] Minutes for January 15, 2007 teleconference about OFED 1.2 development progress toward code freeze
Michael S. Tsirkin
mst at mellanox.co.il
Wed Jan 17 06:04:56 PST 2007
> Quoting Or Gerlitz <ogerlitz at voltaire.com>:
> Subject: Re: [openfabrics-ewg] Minutes for January 15, 2007 teleconference about OFED 1.2 development progress toward code freeze
>
> Tziporet Koren wrote:
> > Or Gerlitz wrote:
> >> Tziporet Koren wrote:
> >>
> >>
> >> The bonding package would support: fresh (2.6.20) and some older
> >> upstream kernels along with SLES10 and RH4 Ux (x=3 for sure)
> >>
> >>
> > OK - please send us all the info once its ready
> >>> General changes to the package:
> >>> * Multicast - we wait for Voltaire and Sean to close all technical
> >>> details - should be ready by the end of the week
> >>>
> >>
> >> I have just sent Sean over the list a clarification email, if needed
> >> we would be able to help doing the missing patches and i guess in a
> >> combined effort this would be ready for the end of --next-- week
> >>
> >>
> > Thanks - please work with MST & Vlad on integration
> >> what about the host side QoS code? i did not see an newer RFC nor
> >> patch other then the RFC that was sent many months ago.
>
> > We are going to update our low level driver (mthca) to support it.
>
> > Beside there should be a small change in CMA for this, and its specified
> > in the RFC.
>
> I understand that the change involves letting the rdma cm know the SID
> when the consumer calls --rdma_resolve_route-- where today it get to
> know the SID when the consumer calls --rdma_connect-- . So this is not
> an internal RDMA CM change but rather also changes the API.
>
> Same for SRP as the api of ib_sa_path_rec_get (that is the structure it
> gets as input) changes, the SRP code also changes.
>
> Any, can you send the mthca and rdmacm/rdmacm-consumers changes as
> RFC/PATCH over the list before the actual code freeze???
I didn't start on this code yet, but it does not look like a
huge project, I hope to post code by next week.
To avoid major disruptions all over the stack, my preference for OFED 1.2
would be to add new API calls and a module option (off by default) for cma/srp
to use them.
> As for the QoS RFC
> (http://openib.org/pipermail/openib-general/2006-May/022331.html) sent
> by Eitan, one design issue I see there is how to deal with IB ULPs which
> do --not-- have a well known SID. So they call ib_cm_listen with
> IB_CM_ASSIGN_SERVICE_ID and get from the CM a service id to use, then
> they might do some out of band exchange of this SID before starting
> their connection establishment.
>
> from include/rdma/ib_cm.h
>
> > * @service_id: Service identifier matched against incoming connection
> > * and service ID resolution requests. The service ID should be specified
> > * network-byte order. If set to IB_CM_ASSIGN_SERVICE_ID, the CM will
> > * assign a service ID to the caller.
>
> Typically this happens with MPI up to the extent that different ranks
> within the same job may get a different SID. One solution i was thinking
> of is to
>
> +1 define --range-- (eg big enough to serve 1024 CM consumers) per ULP
> +2 change the CM to support allocating SID in a range
> +3 change the ULPs which use IB_CM_ASSIGN_SERVICE_ID to ask SID in the
> relevant range
> +4 change the QoS manager at the SM side to support ranges
For OFED 1.2, I only planned to implement this for SDP and SRP.
I do not expect all this to be mergeable in 2.6.21 time frame,
so maybe that's enough.
So I think I'll opt for an easier
+5 don't set SID in path record query for userspace apps
--
MST
More information about the ewg
mailing list