[openib-general] mvapich2 pmi scalability problems

Matthew Koop koop at cse.ohio-state.edu
Fri Jul 21 15:26:27 PDT 2006


> Since we are compiling for ia64 our assumption is it compiled with
> HAVE_MPD_RING="-DUSE_MPD_RING". Is this correct?
> Also, we are not using mpd to run start the jobs. Since we are
> using slurm as the resource manager the jobs are started with
> srun. Does MPD_RING on apply if using MDP?

It should be using the USE_MPD_RING flag in that case. Just to make sure,
can you just verify by seeing the compile flags used by `grep`ing mpicc:

grep MPI_CFLAGS mpicc

it should print out -DUSE_MPD_RING. Even though you are using SLURM this
option should still work since MPD is not actually used -- it is making
use of the PMI interface only.

The numbers of messages you posted earlier would be consistent with what
would be expected due to IB QP information that is exchanged at startup.
The preferred setup, which should show superior scalability is using
-DUSE_MPD_RING. Even in the all PMI case we should be able to add some
additional optimizations, but the ring startup is really what should be
used.

Matt






More information about the general mailing list