[openib-general] [PATCH] librdmacm: fix bug causing failure to work with partial membership pkey
Hal Rosenstock
halr at voltaire.com
Thu Feb 22 06:45:56 PST 2007
On Thu, 2007-02-22 at 03:04, Or Gerlitz wrote:
> Sean Hefty wrote:
> >> Note that since the HCA validates the pkey in the in coming packet, no
> >> matter what the IB SW would do, partial members of a partition can't
> >> talk to each other. So the approach taken by the core/ipoib code was
> >> to just ignore the MSb in places where the code looks for the pkey
> >> --index-- and use the full member pkey when forming MGIDs. This seems
> >> fine to me.
>
> > My concern is that ib_find_cached_pkey() returns an index to a pkey that wasn't
> > the one in the search. Can this lead to a QP being configured in such a way
> > that communication with a remote QP would silently fail?
>
> My understanding is that when an IPoIB broadcast domain contains both
> partial and full members (*) attempts to communicate between two partial
> members would silently fail,
An IB multicast group _cannot_ have partial members so this never should
get far enough to where two limited members would be unable to
communicate.
-- Hal
> does this silence is something you think we
> should work to change?
>
> (*) eg when you have bunch or clients and a server or bunch of servers
> and you don't want to allow --clients-- to communicate among themselves)
>
> > I'm not against this patch, but I want to make sure that I understand the
> > issues, so we're not creating a work-around solution. The patch is against the
> > librdmacm, yet there's nothing that I see in the librdmacm that makes me think
> > it's behaving incorrectly.
>
> My thinking is that if in the end of this thread we are willing to move
> forward without changing ib_find_cached_pkey() then this patch should be
> merged.
>
> Or.
More information about the general
mailing list