[R] Distributed computing with R

Roger D. Peng rpeng at jhsph.edu
Thu Jun 3 04:56:25 CEST 2004


I would suggest installing PVM or LAM-MPI and using the R 
packages `snow' and `rpvm' (or `Rmpi').  I've found the `snow' 
package very simple to use and useful for quick and dirty 
solutions.  I've used `snow' with an openMosix setup and on a 
simple cluster of workstations without any scheduler.  openMosix 
is nice because you don't have to worry about which process goes 
where but that's not to say it doesn't have its own difficulties.
Overall, my experience with parallel computing in R has been a 
little clunky but that's mostly because the problems I work on 
don't benefit much from such a setup.

-roger

Saroj Mohapatra wrote:
> Dear all,
> 
> We have started using R for data analysis since a few months and find it
> useful. We are planning to acquire a high-end dedicated system for
> microarray data analysis and thinking of a distributed environment. I
> would appreciate if some one could send some pointers regarding how to
> choose a proper hardware configuration, software (R or other software,
> esp. MATLAB), issues on setting up the cluster, etc. Has anyone here
> some experience of R on a cluster? Does it provide significant benefits
> as regards processing time? Is setting up the cluster more difficult
> than using R on it?
> 
> Thanks.
> 
> Saroj K Mohapatra, MD
> Research Associate
> Tainsky Lab
> Karmanos Cancer Institute
> Wayne State University School of Medicine
> 110 E. Warren, Room 311
> Detroit MI 48201
> 313-833-0715 x2424
> saroj at wayne.edu
> 
> ______________________________________________
> R-help at stat.math.ethz.ch mailing list
> https://www.stat.math.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide! http://www.R-project.org/posting-guide.html
>




More information about the R-help mailing list