[R] how to bread while loop reading from connection with read.csv
Berend Hasselman
bhh at xs4all.nl
Mon Jan 21 17:34:27 CET 2013
On 21-01-2013, at 16:56, "Collins, Stephen" <Stephen.Collins at allstate.com> wrote:
> Hello,
>
> I'm trying to read a file rows at a time, so as to not read the entire file into memory. When reading the "connections" and "readLines" help, and "R help archive," it seems this should be possible with read.csv and a file connection, making use of the "nrows" argument, and checking where the "nrow()" of the new batch is zero rows.
>
>> From certain posts, it seemed that read.csv should return "character(0)" when the end of file is reached, and there are no more rows to read. Instead, I get an error there are "no lines available for input." Have I made a mistake with the file, or calling read.csv?
>
> What is the proper way to check the end-of-file condition with read.csv, such that I could break a while loop reading the data in?
>
> #example, make a test file
> con <- file("test.csv","wt")
> cat("a,b,c\n", "1,2,3\n", "4,5,6\n", "7,6,5\n", "4,3,2\n", "3,2,1\n",file=con)
> unlink(con)
>
> #show the file is valid
> con <- file("test.csv","rt")
> read.csv(con,header=T)
> unlink(con)
>
> #show that readLines ends with "character(0)", like expected
> con <- file("test.csv","rt")
> readLines(con,n=10)
> readLines(con,n=10)
> unlink(con)
>
> #show that read.csv end with error
> con <- file("test.csv","rt")
> read.csv(con,header=T,nrows=10)
> read.csv(con,header=F,nrows=10)
> unlink(con)
>
How about:
con <- file("test.csv","rt")
while( length(tmp <- readLines(con,n=10)) > 0 ) {
qq <- read.csv(text=tmp, header=TRUE)
# do something with qq
}
unlink(con)
qq
Berend
More information about the R-help
mailing list