skip to main content
poster

OpenMP-style parallelism in data-centered multicore computing with R

Published:25 February 2012Publication History
Skip Abstract Section

Abstract

R1 is a domain specific language widely used for data analysis by the statistics community as well as by researchers in finance, biology, social sciences, and many other disciplines. As R programs are linked to input data, the exponential growth of available data makes high-performance computing with R imperative. To ease the process of writing parallel programs in R, code transformation from a sequential program to a parallel version would bring much convenience to R users. In this paper, we present our work in semi-automatic parallelization of R codes with user-added OpenMP-style pragmas. While such pragmas are used at the frontend, we take advantage of multiple parallel backends with different R packages. We provide flexibility for importing parallelism with plug-in components, impose built-in MapReduce for data processing, and also maintain code reusability. We illustrate the advantage of the on-the-fly mechanisms which can lead to significant applications in data-centered parallel computing.

References

  1. H. Chafi et al., A domain-specific approach to heterogeneous parallelism. Proc. of PPoPP'2011, Feb 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. J. Dean and S. Ghemawat, MapReduce: simplified data processing on large clusters. Proc. of OSDI'2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. G. Dotzler, R. Veldema and M. Klemm, JCudaMP: OpenMP/Java on CUDA. Proc. of the 3rd Int. Workshop on Multicore Software Engineering (IWMSE'2010), May 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. D. Eddelbuettel and R. Francois, Rcpp: seamless R and C++ integration. Journal of Statistical Software, vol. 40, iss. 8, 2011.Google ScholarGoogle ScholarCross RefCross Ref
  5. M. Feng, R. Gupta and Y. Hu, SpiceC: scalable parallelism via implicit copying and explicit commit. Proc. of PPoPP'2011, Feb 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. M. Schmidberger et al., State of the art in parallel computing with R. Journal of Statistical Software, vol. 31, iss.1, 2009.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. OpenMP-style parallelism in data-centered multicore computing with R

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in

      Full Access

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader
      About Cookies On This Site

      We use cookies to ensure that we give you the best experience on our website.

      Learn more

      Got it!