[R] big data
Dirk Eddelbuettel
edd at debian.org
Wed Sep 8 14:30:41 CEST 2010
On 8 September 2010 at 13:26, André de Boer wrote:
| I searched the internet but i didn't find the answer for the next problem:
| I want to do a glm on a csv file consisting of 25 columns and 4 mln rows.
| Not all the columns are relevant. My problem is to read the data into R.
| Manipulate the data and then do a glm.
|
| I've tried with:
|
| dd<-scan("myfile.csv",colClasses=classes)
| dat<-as.data.frame(dd)
|
| My question is: what is the right way to do is?
| Can someone give me a hint?
Look at the biglm package by Thomas Lumley which will allow you to fit glm
models in "chunks".
Dirk
--
Dirk Eddelbuettel | edd at debian.org | http://dirk.eddelbuettel.com
More information about the R-help
mailing list