[R] Very Large Data Sets

Bill Venables William.Venables at cmis.CSIRO.AU
Thu Dec 23 07:05:23 CET 1999

Tony Fagan asks:

> List,


> Can R handle very large data sets (say, 100 million records) for data 
> mining applications? 

The question assumes that the data handling capacity is a
property of the software alone, which is nonsense.  It is partly
a property of the software, partly of what you want to do with
the records, but mostly of the system on which it is run.

> My understanding is that Splus can not, but SAS can easily.

Try handling 100 million records with SAS (or anything else) on a
486 and see how easily it does it.

More seriously, the consensus is that on the same modern system
SAS is usually better able to handle large, dumb calculations
than S-PLUS, which is (generally) better than R.  Horses for

Bill Venables.
Bill Venables, Statistician, CMIS Environmetrics Project.

Physical address:                            Postal address:
CSIRO Marine Laboratories,                   PO Box 120,       
233 Middle St, Cleveland, Queensland         Cleveland, Qld, 4163
AUSTRALIA                                    AUSTRALIA

Telephone: +61 7 3826 7251     Email: Bill.Venables at cmis.csiro.au

      Fax: +61 7 3826 7304

r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html
Send "info", "help", or "[un]subscribe"
(in the "body", not the subject !)  To: r-help-request at stat.math.ethz.ch

More information about the R-help mailing list