[ofa-general] post_recv question
Jeff Squyres
jsquyres at cisco.com
Thu Feb 21 10:36:46 PST 2008
On Feb 21, 2008, at 10:19 AM, Gleb Natapov wrote:
>> To further complicate things, this race condition is never seen
>> _if_ the
>> application uses the same QP to advertise (send a credit allowing the
>> peer to SEND) the RECV buffer availability. So if the app posts a
>> SEND
>> after the RECV is posted and that SEND allows the peer access to the
>> RECV buffer, then everything is ok. This is due to the fact that the
>> FW/HW will process the SEND only after processing the RECV. If the
>> app
>> uses a different QP to post the SEND advertising the RECV, then the
>> race
>> condition exists allowing the peer to SEND into that RECV buffer
>> before
>> the HW makes it ready.
>>
> OpenMPI can be configured to send credit updates over different QP.
> I'll
> try to stress it next week to see what happens.
FWIW: this is exactly where the question arose: Steve's working on the
iwarp port of OMPI, and since we send the flow control messages for
all QP's between a pair of processes over a single QP, this apparent
race condition can occur.
--
Jeff Squyres
Cisco Systems
More information about the general
mailing list