[R] how to read this kind of csv in R?
Duncan Murdoch
murdoch@dunc@n @end|ng |rom gm@||@com
Sun Oct 6 14:08:41 CEST 2019
On 06/10/2019 7:29 a.m., vod vos via R-help wrote:
> I got hundreds of csv files. The real formats in each csv file are as follows:
>
> aa(cm)
> 1, 2 , 3,
>
> bb(mm)
> 1, 2, 3,
> 4, 5, 6,
> 7, 8, 9,
>
> cc(mm)
> 3, 4, 5,
> 7, 5, 9,
> 6, 5, 8,
>
> How can I use read.table or read.csv to convert the csv files
> to a tidy data frame format as follow:
>
> aa, bb, cc
> 1, 1, 3
> 1, 2, 4
> 1, 3, 5
> 2, 4, 7
> 2, 5, 5
> 2, 6, 9
> 3, 7, 6
> 3, 8, 5
> 3, 9, 8
>
> many thanks.
You'll need more than those two functions to do the transformation you
want. To work out what you need, write out the process in detail in
English (or another natural language), not in code. For example:
1. Read aa from file 1.
2. Read bb from file 2.
3. Read cc from file 3.
4. Expand all vectors to the same length.
5. Combine them into a single dataframe.
Then work out each step separately. I think you'll want to use
something like scan("filename", skip = 1, sep = ",") in steps 1, 2, and
3, but this will add NA values at the end of each line because of the
final comma, so you could do this:
aa <- scan("file1", skip = 1, sep = ",")
aa <- aa[!is.na(aa)]
and similarly for the others.
I don't know the rules for expanding that you'll need in your real data,
but for your example step 4 could be
aa <- rep(aa, each = 3)
Then step 5 could be
result <- data.frame(aa, bb, cc)
Duncan Murdoch
More information about the R-help
mailing list