[R-pkgs] LaF 0.3: fast access to large ASCII files
Jan van der Laan
djvanderlaan at unrealizedtime.nl
Sun Nov 13 13:42:36 CET 2011
The LaF package provides methods for fast access to large ASCII files.
Currently the following file formats are supported:
* comma separated format (csv) and other separated formats and
* fixed width format.
It is assumed that the files are too large to fit into memory, although
the package can also be used to efficiently access files that do fit
into memory.
In order to process files that are too large to fit into memory, methods
are provided to access and process file blockwise. Furthermore, an
opened file can be indexed as one would a data.frame. In this way
subsets. or specific columns can be read into memory. For example,
assuming that an object laf has been created using one of the functions
laf_open_csv or laf_open_fwf, the third column from the file can be read
into memory using:
> col <- laf[,3]
The LaF-manual vignette contains a description of all functionality
provided:
http://laf-r.googlecode.com/files/LaF-manual_0.3.pdf
The Laf-benchmark vignette compares the performance of LaF to the
standard R-routines read.table and read.fwf:
http://laf-r.googlecode.com/files/LaF-benchmark_0.3.pdf
More information about the R-packages
mailing list