[ewg] bug 1918 - openmpi broken due to rdma-cm changes

Sean Hefty sean.hefty at intel.com
Thu Feb 4 14:33:04 PST 2010


>OpenMPI uses rdma_bind_addr() to figure out which ip addresses are valid
>for which IB devices.   This logic is now broken.  Regardless of whether
>OpenMPI should use another method for determining which IP address
>belong to which interfaces, we should probably rethink whether we're
>breaking rdma-cm semantics in a bad way on a point release.

The changes to the rdma_cm have been merged upstream.  These were fixes
specifically to enable using the loopback address with RDMA devices.

At first thought, we can extend enum ib_device_cap_flags to indicate if a device
supports loopback capabilities or not.  The rdma_cm could then skip over such
devices when dealing with a loopback address. 

- Sean




More information about the ewg mailing list