[R] Working with large datafiles
Thomas Lumley
tlumley at u.washington.edu
Mon Oct 4 17:42:38 CEST 2004
On Mon, 4 Oct 2004, Greg Butler wrote:
> Hi,
>
> I have been enjoying r for some time now, but was wondering about working
> with larger data files. When I try to load in big files with more than
> 20,000 records, the programs seems unbable to store all the records. Is
> there some way that I can increase the size of records that I work with?
> Ideally I would like to work with census data which can hold a million
> records.
>
You should be able to handle 20,000 records on a reasonable computer (my
laptop, with 256Mb memory can, very slowly, do survey analyses on a file
with 26,000 records and about 100 variables).
A million records is likely to be infeasible. A 32bit computer can't
even address enough memory to store that much data. You would need to put
the data either in a database or in a file format such as netCDF or hdf5
that allows smaller chunks to be read and processed.
-thomas
More information about the R-help
mailing list