[R] "Large" data set: performance issue

Peter Dalgaard BSA p.dalgaard at biostat.ku.dk
Tue Apr 2 16:16:41 CEST 2002


Till Baumgaertel <till.baumgaertel at epost.de> writes:

> hi all,
> 
> I've got to import CSV-datasets (with variable-names in the first line)
> into data.frames. each is about 12MB (or more!) with 1823 columns and about
> 500 rows. the first 22 columns are in "character"-mode, the rest is "numeric".
> 
> I run R 1.4.1 on a Windows 2000 system.

What happens if you try this?:

datfull <- read.csv("foo", colClasses=rep(c("character","numeric"),c(22,1801)))


-- 
   O__  ---- Peter Dalgaard             Blegdamsvej 3  
  c/ /'_ --- Dept. of Biostatistics     2200 Cph. N   
 (*) \(*) -- University of Copenhagen   Denmark      Ph: (+45) 35327918
~~~~~~~~~~ - (p.dalgaard at biostat.ku.dk)             FAX: (+45) 35327907
-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-
r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html
Send "info", "help", or "[un]subscribe"
(in the "body", not the subject !)  To: r-help-request at stat.math.ethz.ch
_._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._



More information about the R-help mailing list