[ewg] bug 1918 - openmpi broken due to rdma-cm changes

Steve Wise swise at opengridcomputing.com
Thu Feb 4 14:39:23 PST 2010


Sean Hefty wrote:
>> OpenMPI uses rdma_bind_addr() to figure out which ip addresses are valid
>> for which IB devices.   This logic is now broken.  Regardless of whether
>> OpenMPI should use another method for determining which IP address
>> belong to which interfaces, we should probably rethink whether we're
>> breaking rdma-cm semantics in a bad way on a point release.
>>     
>
> The changes to the rdma_cm have been merged upstream.  These were fixes
> specifically to enable using the loopback address with RDMA devices.
>
> At first thought, we can extend enum ib_device_cap_flags to indicate if a device
> supports loopback capabilities or not.  The rdma_cm could then skip over such
> devices when dealing with a loopback address. 
>
> - Sean
>   

But how can you determine _which_ rdma device should be used if and app 
binds to 127.0.0.1?  I think this is busted...





More information about the ewg mailing list