[ofiwg] [libfabric-users] intel mpi with libfabric

Ilango, Arun arun.ilango at intel.com
Wed Nov 21 10:42:20 PST 2018


Mohammed,

Just to add what Dmitry said, if you're using your own libfabric, please make sure it's the latest (i.e. v1.6.2). You can check the version by running fi_info --version.

Other things to check:
1. Make sure you have librdmacm package installed.
2. Check if the IPoIB interface of the node has been configured with an IP address and is pingable from other nodes in the cluster.

Thanks,
Arun.

-----Original Message-----
From: Gladkov, Dmitry 
Sent: Wednesday, November 21, 2018 10:31 AM
To: Hefty, Sean <sean.hefty at intel.com>; Mohammed Shaheen <m_shaheen1984 at yahoo.com>; libfabric-users at lists.openfabrics.org; ofiwg at lists.openfabrics.org
Cc: Ilango, Arun <arun.ilango at intel.com>
Subject: RE: [libfabric-users] intel mpi with libfabric

Hi Mohammed,

Do you use your own version of libfabirc?

IMPI 2019 U1 uses its internal libfabric by default.
If you use your libfabric, please, specify LD_LIBRABRY_PATH to your library and FI_PROVIDER_PATH to path to OFI DL providers (<ofi_install_dir>/lib/libfabric) if you use DL provider, or unset this variable (mpivars.sh sets it).

--
Dmitry

-----Original Message-----
From: Hefty, Sean
Sent: Wednesday, November 21, 2018 8:52 PM
To: Mohammed Shaheen <m_shaheen1984 at yahoo.com>; libfabric-users at lists.openfabrics.org; ofiwg at lists.openfabrics.org
Cc: Ilango, Arun <arun.ilango at intel.com>; Gladkov, Dmitry <dmitry.gladkov at intel.com>
Subject: RE: [libfabric-users] intel mpi with libfabric

Copying ofiwg and key developers for this issue.

- Sean

> I get the following error running a small mpi test program using intel 
> mpi 2019 from intel parallel studio cluster edition update 1 (the
> newest) on Mellanox FDR Cluster:
> 
> 
> 
> test.e: prov/verbs/src/ep_rdm/verbs_rdm_cm.c:337:
> fi_ibv_rdm_process_addr_resolved: Assertion `id->verbs == ep->domain-
> >verbs' failed.
> 
> 
> 
> The program hangs on this error message. I installed the newest 
> release of libfabric and configured it with only ibverbs support. I 
> used the inbox (sles 11 sp4 and sles 12 sp3) ibverbs and rdma 
> libraries. I also tried with mellanox ofed to no avail.
> 
> 
> 
> 
> Any ideas how to go about it?
> 
> 
> 
> 
> 
> Regards,
> 
> Mohammed



More information about the ofiwg mailing list