[libfabric-users] An approach on Two Multi-process Applications Communication using RDM and RMA.

Hefty, Sean sean.hefty at intel.com
Mon May 11 09:48:43 PDT 2020


> I am having trouble creating communication between two applications. One is MPI-focused
> and the other one is process-focused, when I get to the initialization of the AV, the
> other side responds with an error 61, but the examples from fabtests seem to work (at
> least fi_rdm_rma_simple). What would be the best approach? I am using active endpoints,
> but it seems it is not the right technique. What are some of the main considerations I
> must keep while doing this?

There's not a lot of details on what's happening.  You need to ensure that all sides follow the same protocol for inserting addresses into the AV.  Fabtests does this with itself, which is why it works.  Fabtests carry the address in the first packet sent from a client to the server.  Many MPI apps rely on an out of band setup to initialize their AVs.  That's definitely easier, but requires that the nodes already have some level of communication between them, usually done through a process manager.

- Sean


More information about the Libfabric-users mailing list