Abstract
In this paper, we evaluate the performance and usability of the parallel programming model OpenMP Superscalar (OmpSs), apply it to 10 different benchmarks and compare its performance with corresponding POSIX threads implementations.
- C. C. Chi and B. Juurlink. A QHD-Capable Parallel H.264 Decoder. In Proc. 25th Int. Conf. on Supercomputing, 2011. Google Scholar
Digital Library
- A. Duran, X. Teruel, R. Ferrer, X. Martorell, and E. Ayguadé. Barcelona OpenMP Tasks Suite: A Set of Benchmarks Targeting the Exploitation of Task Parallelism in OpenMP. In Proc. Int. Conf. on Parallel Processing, 2009. Google Scholar
Digital Library
- J. M. Perez, R. M. Badia, and J. Labarta. A Flexible and Portable Programming Model for SMP and Multi-cores. Technical report, BSC-UPC, 2007.Google Scholar
- A. Pop and A. Cohen. A Stream-Computing Extension to OpenMP. In Proc. 6th Int. Conf. on High Performance and Embedded Architectures and Compilers, 2011. Google Scholar
Digital Library
- H. Vandierendonck, P. Pratikakis, and D. Nikolopoulos. Parallel Programming of General-Purpose Programs Using Task-Based Programming Models. In Proc. 3rd USENIX Workshop on Hot Topics in Parallelism, 2011. Google Scholar
Digital Library
Index Terms
Programming parallel embedded and consumer applications in OpenMP superscalar
Recommendations
Programming parallel embedded and consumer applications in OpenMP superscalar
PPoPP '12: Proceedings of the 17th ACM SIGPLAN symposium on Principles and Practice of Parallel ProgrammingIn this paper, we evaluate the performance and usability of the parallel programming model OpenMP Superscalar (OmpSs), apply it to 10 different benchmarks and compare its performance with corresponding POSIX threads implementations.
Evaluation of Multicore Processors for Embedded Systems by Parallel Benchmark Program Using OpenMP
IWOMP '09: Proceedings of the 5th International Workshop on OpenMP: Evolving OpenMP in an Age of Extreme ParallelismRecently, multicore technology has been introduced to embedded systems in order to improve performance and reduce power consumption. In the present study, three SMP multicore processors for embedded systems and a multicore processor for a desktop PC are ...
Exploring the interoperability of remote GPGPU virtualization using rCUDA and directive-based programming models
Directive-based programming models, such as OpenMP, OpenACC, and OmpSs, enable users to accelerate applications by using coprocessors with little effort. These devices offer significant computing power, but their use can introduce two problems: an ...







Comments