Simple Parallel Statistical Computing in R
- 1 June 2007
- journal article
- Published by Taylor & Francis in Journal of Computational and Graphical Statistics
- Vol. 16 (2) , 399-420
- https://doi.org/10.1198/106186007x178979
Abstract
Theoretically, many modern statistical procedures are trivial to parallelize. However, practical deployment of a parallelized implementation which is robust and reliably runs on different computational cluster configurations and environments is far from trivial. We present a framework for the R statistical computing language that provides a simple yet powerful programming interface to a computational cluster of CPUs. This interface allows the rapid development of R functions that distribute independent computations across the nodes of the computational cluster. The approach can be extended to finer grain parallelization if needed. The resulting framework allows statisticians to obtain significant speed-ups for some computations at little additional development cost. The particular implementation can be deployed in ad-hoc heterogeneous computing environments.Keywords
This publication has 9 references indexed in Scilit:
- An Object-Oriented Random-Number Package with Many Long Streams and SubstreamsOperations Research, 2002
- Asynchronous Parallel Pattern Search for Nonlinear OptimizationSIAM Journal on Scientific Computing, 2001
- Lexical Scope and Statistical ComputingJournal of Computational and Graphical Statistics, 2000
- The MOSIX multicomputer operating system for high performance cluster computingFuture Generation Computer Systems, 1998
- R: A Language for Data Analysis and GraphicsJournal of Computational and Graphical Statistics, 1996
- Annealing Markov Chain Monte Carlo with Applications to Ancestral InferenceJournal of the American Statistical Association, 1995
- Integrating Parallelization Strategies for Linkage AnalysisComputers and Biomedical Research, 1995
- Parallelization of General-Linkage Analysis ProblemsHuman Heredity, 1994
- Applications of Parallel Computation to Statistical InferenceJournal of the American Statistical Association, 1988