[ofa-general] [GIT PULL] OFED 1.2 uDAPL release notes

Sean Hefty sean.hefty at intel.com
Fri Jul 6 09:48:02 PDT 2007


>Eventhough I force all ranks only using the first card (ib0), it works
>for a while and
>then fails with NON_PEER_REJECTED when one rank tries to connect to
>another rank (dat_connect()
>and dat_evd_wait()). (I run a simple MPI job in an infinite loop, it
>fails after hundreds runs);

This sounds like it could be a race condition as a result of running the test in
a loop.  If the client starts before the server is listening, it will receive
this sort of reject event.

>It works on the first card (ib0), failed on the second card (ib1)

Please take a look at the following thread:

http://lists.openfabrics.org/pipermail/general/2007-May/036559.html

In particular, see Steve's message about this:

http://lists.openfabrics.org/pipermail/general/2007-May/036571.html

and let me know if his suggestion fixes your problem.

I will update the librdmacm documentation with this information as well.

- Sean



More information about the general mailing list