Abstract
The NAS Parallel Benchmarks (NPB) are well-known applications with the fixed algorithms for evaluating parallel systems and tools. Multicore supercomputers provide a natural programming paradigm for hybrid programs, whereby OpenMP can be used with the data sharing with the multicores that comprise a node and MPI can be used with the communication between nodes. In this paper, we use SP and BT benchmarks of MPI NPB 3.3 as a basis for a comparative approach to implement hybrid MPI/OpenMP versions of SP and BT. In particular, we can compare the performance of the hybrid SP and BT with the MPI counterparts on large-scale multicore supercomputers. Our performance results indicate that the hybrid SP outperforms the MPI SP by up to 20.76%, and the hybrid BT outperforms the MPI BT by up to 8.58% on up to 10,000 cores on BlueGene/P at Argonne National Laboratory and Jaguar (Cray XT4/5) at Oak Ridge National Laboratory. We also use performance tools and MPI trace libraries available on these supercomputers to further investigate the performance characteristics of the hybrid SP and BT.
- Argonne Leadership Computing Facility BlueGene/P (Intrepid), Argonne National Laboratory, http://www.alcf.anl.gov/resources.Google Scholar
- D. Bailey, E. Barszcz, et al., The NAS Parallel Benchmarks, Tech. Report RNR-94-007, 1994.Google Scholar
- F. Cappello and D. Etiemble, MPI versus MPI+OpenMP on the IBM SP for the NAS Benchmarks, SC2000. Google Scholar
Digital Library
- Cray Performance analysis toolkit (CrayPat), http://www.nccs.gov/computing-resources/jaguar/software/?software=craypat. Also see Using Cray Performance Analysis Tools, Cray Doc S-2376-41, 2007.Google Scholar
- H. Jin, M. Frumkin and J. Yan, The OpenMP Implementation of NAS Parallel Benchmarks and Its Performance, NAS Technical Report NAS-99-011, October 1999.Google Scholar
- G. Jost, H. Jin, D. Mey, and F. Hatay, Comparing the OpenMP, MPI, and Hybrid Programming Paradigms on an SMP Cluster, the Fifth European Workshop on OpenMP (EWOMP03), Sep. 2003.Google Scholar
- H. Jin and R. Van der Wijingaart, Performance Characteristics of the Multi-Zone NAS Parallel Benchmarks, IPDPS'04, 2004.Google Scholar
- G. Lakner, I. Chung, G. Cong, S. Fadden, N. Goracke, D. Klepacki, J. Lien, C. Pospiech, S. R. Seelam, and H. Wen, IBM System Blue Gene Solution: Performance Analysis Tools, Redbook, REDP-4256-01, November 2008.Google Scholar
- HPCT MPI Profiling and Tracing Library, https://wiki.alcf.anl.gov/index.php/HPCT_MPITRACE.Google Scholar
- NAS Parallel Benchmarks 3.3, http://www.nas.nasa.gov/Resources/Software/npb.html.Google Scholar
- NCCS Jaguar and JaguarPF, Oak Ridge National Laboratory,http://www.nccs.gov/computing-resources/jaguar/Google Scholar
- V. Salapura, K. Ganesan, A. Gara, M. Gschwind, J. Sexton, and R. Walkup, Next-Generation Performance Counters: Towards Monitoring over Thousand Concurrent Events, IBM Research Report, RC24351 (W0709-061), September 19, 2007.Google Scholar
- Universal Performance Counter (UPC) Unit and HPM library for BG/P, https://wiki.alcf.anl.gov/index.php/PerformanceGoogle Scholar
- R. Van der Wijngaart and H. Jin, NAS Parallel Benchmarks, Multi-Zone Versions, NAS Technical Report NAS-03-010, July 2003.Google Scholar
Recommendations
Performance Characteristics of Hybrid MPI/OpenMP Implementations of NAS Parallel Benchmarks SP and BT on Large-Scale Multicore Clusters
The NAS Parallel Benchmarks (NPB) are well-known applications with fixed algorithms for evaluating parallel systems and tools. Multicore clusters provide a natural programming paradigm for hybrid programs, whereby OpenMP can be used with the data ...
Performance modeling of hybrid MPI/OpenMP scientific applications on large-scale multicore supercomputers
In this paper, we present a performance modeling framework based on memory bandwidth contention time and a parameterized communication model to predict the performance of OpenMP, MPI and hybrid applications with weak scaling on three large-scale ...
Tools-supported HPF and MPI parallelization of the NAS parallel benchmarks
FRONTIERS '96: Proceedings of the 6th Symposium on the Frontiers of Massively Parallel ComputationHigh Performance Fortran (HPF) compilers and communication libraries with the standardized Message Passing Interface (MPI) are becoming widely available, easing the development of portable parallel applications. The Annai tool environment supports ...






Comments