[openib-general] Announcing the release of MVAPICH2 0.9.5 with SRQ, integrated multi-rail and TotalView support

Dhabaleswar Panda panda at cse.ohio-state.edu
Wed Aug 30 20:58:38 PDT 2006


The MVAPICH team is pleased to announce the availability of MVAPICH2
0.9.5 with the following NEW features:

 - Shared Receive Queue (SRQ) and Adaptive RDMA support: These
   features reduce memory usage of the MPI library significantly to
   provide scalability without any degradation in performance.

   Performance of applications and memory scalability using SRQ
   and Adaptive RDMA support can be seen by visiting the following
   URL:

   http://nowlab.cse.ohio-state.edu/projects/mpi-iba/perf-apps.html 

 - Integrated multi-rail communication support for both two-sided and 
   one-sided operations 
     - Multiple queue pairs per port
     - Multiple ports per adapter
     - Multiple adapters
    
 - Support for TotalView debugger 

 - Auto-detection of Architecture and InfiniBand adapters 

More details on all features and supported platforms can be obtained
by visiting the following URL:

http://nowlab.cse.ohio-state.edu/projects/mpi-iba/mvapich2_features.html

MVAPICH2 0.9.5 continues to deliver excellent performance.  Sample
performance numbers include:
 
  - OpenIB/Gen2 on EM64T with PCI-Ex and IBA-DDR:
      Two-sided operations: 
        - 2.97 microsec one-way latency (4 bytes)
        - 1478 MB/sec unidirectional bandwidth 
        - 2658 MB/sec bidirectional bandwidth  

      One-sided operations:
        - 5.08 microsec Put latency 
        - 1484 MB/sec unidirectional Put bandwidth 
        - 2658 MB/sec bidirectional Put bandwidth 

  - OpenIB/Gen2 on EM64T with PCI-Ex and IBA-DDR (Dual-rail):
      Two-sided operations:
        - 3.01 microsec one-way latency (4 bytes)
        - 2346 MB/sec unidirectional bandwidth
        - 2779 MB/sec bidirectional bandwidth

      One-sided operations:
        - 4.70 microsec Put latency
        - 2389 MB/sec unidirectional Put bandwidth
        - 2779 MB/sec bidirectional Put bandwidth

  - OpenIB/Gen2 on Opteron with PCI-Ex and IBA-DDR: 
      Two-sided operations:
        - 2.71 microsec one-way latency (4 bytes) 
        - 1411 MB/sec unidirectional bandwidth
        - 2238 MB/sec bidirectional bandwidth 

      One-sided operations:
        - 4.28 microsec Put latency
        - 1411 MB/sec unidirectional Put bandwidth
        - 2238 MB/sec bidirectional Put bandwidth

  - Solaris uDAPL/IBTL on Opteron with PCI-Ex and IBA-SDR:
      Two-sided operations:
        - 4.81 microsec one-way latency (4 bytes)
        - 981 MB/sec unidirectional bandwidth
        - 1903 MB/sec bidirectional bandwidth

      One-sided operations:
        - 7.49 microsec Put latency
        - 981 MB/sec unidirectional Put bandwidth
        - 1903 MB/sec bidirectional Put bandwidth

  - OpenIB/Gen2 uDAPL on EM64T with PCI-Ex and IBA-SDR:
      Two-sided operations:
        - 3.56 microsec one-way latency (4 bytes)
        - 964 MB/sec unidirectional bandwidth
        - 1846 MB/sec bidirectional bandwidth

      One-sided operations:
        - 6.85 microsec Put latency
        - 964 MB/sec unidirectional Put bandwidth
        - 1846 MB/sec bidirectional Put bandwidth

  - OpenIB/Gen2 uDAPL on EM64T with PCI-Ex and IBA-DDR:
      Two-sided operations:
        - 3.18 microsec one-way latency (4 bytes)
        - 1484 MB/sec unidirectional bandwidth
        - 2635 MB/sec bidirectional bandwidth

      One-sided operations:
        - 5.41 microsec Put latency
        - 1485 MB/sec unidirectional Put bandwidth
        - 2635 MB/sec bidirectional Put bandwidth

Performance numbers for all other platforms, system configurations and
operations can be viewed by visiting `Performance' section of the
project's web page.

With the ADI-3-level design, MVAPICH2 0.9.5 delivers similar
performance for two-sided operations compared to MVAPICH 0.9.8.
Performance comparison between MVAPICH2 0.9.5 and MVAPICH 0.9.8 for
sample applications can be seen by visiting the following URL:

  http://nowlab.cse.ohio-state.edu/projects/mpi-iba/perf-apps.html 

Organizations and users interested in getting the best performance for
both two-sided and one-sided operations and also want to exploit
`multi-threading' and `integrated multi-rail' capabilities may migrate
from MVAPICH code base to MVAPICH2 code base.

For downloading MVAPICH2 0.9.5 package and accessing the anonymous
SVN, please visit the following URL:

http://nowlab.cse.ohio-state.edu/projects/mpi-iba/

A stripped down version of this release is also available at the
OpenIB SVN.

All feedbacks, including bug reports and hints for performance tuning,
are welcome. Please post it to the mvapich-discuss mailing list.

Thanks, 

MVAPICH Team at OSU/NBCL 

======================================================================
MVAPICH/MVAPICH2 project is currently supported with funding from
U.S. National Science Foundation, U.S. DOE Office of Science,
Mellanox, Intel, Cisco Systems, Sun Microsystems and Linux Networx;
and with equipment support from Advanced Clustering, AMD, Apple,
Appro, Dell, IBM, Intel, Mellanox, Microway, PathScale, SilverStorm
and Sun Microsystems. Other technology partner includes Etnus.
======================================================================





More information about the general mailing list