[Rd] R's IO speed
Martin Maechler
maechler at stat.math.ethz.ch
Sun Dec 26 12:34:12 CET 2004
>>>>> "BDR" == Prof Brian Ripley <ripley at stats.ox.ac.uk>
>>>>> on Sun, 26 Dec 2004 10:03:30 +0000 (GMT) writes:
BDR> R-devel now has some improved versions of read.table
BDR> and write.table. For a million-row data frame
BDR> containing one number, one factor with few levels and
BDR> one logical column, a 56Mb object.
BDR> generating it takes 4.5 secs.
BDR> calling summary() on it takes 2.2 secs.
BDR> writing it takes 8 secs and an additional 10Mb.
BDR> saving it in .rda format takes 4 secs.
BDR> reading it naively takes 28 secs and an additional
BDR> 240Mb
BDR> reading it carefully (using nrows, colClasses and
BDR> comment.char) takes 16 secs and an additional 150Mb
BDR> (56Mb of which is for the object read in). (The
BDR> overhead of read.table over scan was about 2 secs,
BDR> mainly in the conversion back to a factor.)
BDR> loading from .rda format takes 3.4 secs.
BDR> [R 2.0.1 read in 23 secs using an additional 210Mb, and
BDR> wrote in 50 secs using an additional 450Mb.]
Excellent!
Thanks a lot Brian (for this and much more)!
I wish you continued merry holidays!
Martin
More information about the R-devel
mailing list