[R-SIG-Finance] Running R as a server or in a cluster
Joshua Reich
josh at gghc.com
Wed Sep 26 16:26:58 CEST 2007
Yes - its JBOC (just a bunch of computers). You provide them with a disk
image (of sorts) and they will load it on to as many computers as you
request. Images are loaded and machines are requested via a web services
API. Initially you can request up to 20 machines - but if you email them
you can ask for more. All network bandwidth between machines is free,
but there is a per GB transfer charge for external connectivity - I
can't recall what the rate is, but it is very reasonable.
Not being a specialized grid environment, all inter-node communication
and scheduling has to be handled by your own application. But for the
price, that's not too bad.
While I was aware of SNOW, I'm not familiar with the other clustering
approaches mentioned earlier in this thread. What special sauce does Sun
provide to make running on a grid easier than running on a JBOC style
setup?
Josh
-----Original Message-----
From: Jeff Ryan [mailto:jeff.a.ryan at gmail.com]
Sent: Wednesday, September 26, 2007 10:18 AM
To: Joshua Reich
Cc: Brian G. Peterson; r-sig-finance at stat.math.ethz.ch
Subject: Re: [R-SIG-Finance] Running R as a server or in a cluster
I do know the Sun one is using their grid software, and is supposedly
highly secure. Basically have access to a 2000 node opteron cluster.
The Amazon one seems to be more of using a machine, one at a time. Is
that correct?
On 9/26/07, Joshua Reich <josh at gghc.com> wrote:
> We recently set up a similar environment using Amazon's EC2 service.
> They charge $0.1 per CPU hour. I can't say what our results have been
> like yet - still ironing out the kinks in our R code. But I will
> certainly let you all know how it goes.
>
> Our 'clustering' mechanism is very simple. We have written perl
> scripts that receive data over HTTP, start R, process the data, and
> then post the results back via HTTP to a central server.
>
> Josh
>
> -----Original Message-----
> From: r-sig-finance-bounces at stat.math.ethz.ch
> [mailto:r-sig-finance-bounces at stat.math.ethz.ch] On Behalf Of Jeff
> Ryan
> Sent: Wednesday, September 26, 2007 10:08 AM
> To: Brian G. Peterson
> Cc: r-sig-finance at stat.math.ethz.ch
> Subject: Re: [R-SIG-Finance] Running R as a server or in a cluster
>
> Hi all,
>
> Short of answers, but I do wonder if anyone has used Sun Microsystems
> www.network.com for grid work with R. At 1USD a CPU hr, with R
> already built - and a working example script on the service - it seems
> like a path worth exploring.
>
> Has anyone given it a try. I set up an account, but have yet to get
> the opportunity to try it out.
>
> Here is the link:
>
> http://www.network.com/apps/r_project.html
>
> Jeff Ryan
>
> On 9/26/07, Brian G. Peterson <brian at braverock.com> wrote:
> > Adrian Dragulescu wrote:
> > > We have set up a Condor cluster, see
> > > http://www.cs.wisc.edu/condor/ and we submit R jobs to the
> > > cluster. It works well because Condor has very advanced
scheduling capabilities, job monitoring, etc.
> >
> > Adrian,
> >
> > Could you provide more details? Are you running Rserve on the
> > cluster, running "R CMD BATCH", or using Parallel-R?
> >
> > I'd like to suggest that we use this thread to continue to develop
> > the
>
> > collective knowledge of the r-sig-finance community on distributed
> > or high-throughput R calculations.
> >
> > Regards,
> >
> > - Brian
> >
> > _______________________________________________
> > R-SIG-Finance at stat.math.ethz.ch mailing list
> > https://stat.ethz.ch/mailman/listinfo/r-sig-finance
> > -- Subscriber-posting only.
> > -- If you want to post, subscribe first.
> >
>
> _______________________________________________
> R-SIG-Finance at stat.math.ethz.ch mailing list
> https://stat.ethz.ch/mailman/listinfo/r-sig-finance
> -- Subscriber-posting only.
> -- If you want to post, subscribe first.
>
More information about the R-SIG-Finance
mailing list