NBCBench - benchmarking Nonblocking MPI Collective Operations Performance

Description

NBCBench is a benchmark that measures overlap and asynchronous progression of nonblocking collective operations implemented in LibNBC. NBCBench is distributed under the BSD license.

Download NBCBench

Performance Results for different MPI Implementations

We present performance results of LibNBC for different MPI implementations. LibNBC issues MPI_Isend() and MPI_Irecv() calls, and the performance and possible overlap depends on the implementation in MPI. We do also compare the collective operations implemented in LibNBC to the MPI operations. Results are available for the following MPI Implementations: Please keep in mind that not all collective algorithms in LibNBC are optimized!

Benchmark Methodology

We used the overlap-benchmark which has been designed to assess the maximal possible overlap and the minimal latencies. The benchmark will be described later. Details can be found in [1] and [2].

References

PMEO'08
[1] T. Hoefler, T. Schneider, A. Lumsdaine:
 Accurately Measuring Collective Operations at Massive Scale In Proceedings of the 22nd IEEE International Parallel & Distributed Processing Symposium, PMEO'08 Workshop, presented in Miami, FL, ISSN: 1530-2075, ISBN: 978-1-4244-1694-3, Apr. 2008, Invited to a journal special issue on top picks from PMEO'08.
SC07
[2] T. Hoefler, A. Lumsdaine, W. Rehm:
 Implementation and Performance Analysis of Non-Blocking Collective Operations for MPI In Proceedings of the 2007 International Conference on High Performance Computing, Networking, Storage and Analysis, SC07, presented in Reno, USA, IEEE Computer Society/ACM, Nov. 2007, (acceptance rate 20%, 54/268)