[R] Tools for data preparation?
Liaw, Andy
andy_liaw at merck.com
Fri Nov 19 12:39:18 CET 2004
My choices are (in the order of my preference):
- use connections and readLines()/strsplit()/etc. in R to process the file a
chunk at a time
- use cut/paste/grep/etc., perhaps within pipe() in R
- use awk, perhaps within pipe() in R
- Python is my last resort, as I'm not familiar with it
The first preference is to do it all in R, mostly for the reason that I can
keep track of what was done all in one place (the R script or function).
Andy
> From: David Mitchell
>
> Hello list,
>
> I'm regularly in the position where I have to do a lot of data
> manipulation, in order to get the data I have into a format R is happy
> with. This manipulation would generally be in one of two forms:
> - getting data from e.g. text log files into a tabular format
> - extracting sensible sample data from a very large data set (i.e. too
> large for R to handle)
>
> In general, I use Perl or Python to do the task; I'm curious as to
> what others use when they hit the same problem.
>
> Regards
>
> Dave Mitchell
>
> ______________________________________________
> R-help at stat.math.ethz.ch mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide!
> http://www.R-project.org/posting-guide.html
>
>
More information about the R-help
mailing list