[ofa-general] [GIT PULL] OFED 1.2 uDAPL release notes
    Sean Hefty 
    sean.hefty at intel.com
       
    Fri Jul  6 09:48:02 PDT 2007
    
    
  
>Eventhough I force all ranks only using the first card (ib0), it works
>for a while and
>then fails with NON_PEER_REJECTED when one rank tries to connect to
>another rank (dat_connect()
>and dat_evd_wait()). (I run a simple MPI job in an infinite loop, it
>fails after hundreds runs);
This sounds like it could be a race condition as a result of running the test in
a loop.  If the client starts before the server is listening, it will receive
this sort of reject event.
>It works on the first card (ib0), failed on the second card (ib1)
Please take a look at the following thread:
http://lists.openfabrics.org/pipermail/general/2007-May/036559.html
In particular, see Steve's message about this:
http://lists.openfabrics.org/pipermail/general/2007-May/036571.html
and let me know if his suggestion fixes your problem.
I will update the librdmacm documentation with this information as well.
- Sean
    
    
More information about the general
mailing list