Unix tools (was: [R] read data into R with some constraints)
Prof Brian D Ripley
ripley at stats.ox.ac.uk
Fri Jan 12 19:40:01 CET 2001
On Fri, 12 Jan 2001, Clive Jenkins wrote:
>
> Thu, 11 Jan 2001 14:51:16 -0500 (EST) Jonathan Baron wrote:
> >
> > >I have a big data file (over 30,000 records) looks
> > >like this:
> > >
> > >100, 20, 46, 70
> > >103, 0, 22, 45
> > >117, -1, 34, 65
> > >120, 15, 0, 25
> > >113, 0, -1, 32
> > >142, -1, -1, 55
> > >.....
> > >
> > >I want to read only those records having positive
> > >values in all of the four
> > >columns. That is, I don't want to read record # 3, 5,
> > >and 6 into R.
> >
> > Perhaps someone knows how to do this with R, but, if
> > I had to do it right now, I would pre-process with grep
> > before reading into R:
> >
> > grep -v "-" myddata > myshorterdata
> >
> > The "grep" tool is available on Unix and Linux. If you have
> > Windows, it would be useful to get the "unix tools for windows",
> > which, unfornuately, I have just been unable to find after 10
> > minutes of searching www.gnu.org.
>
> I recommend the DJGPP (DOS/DPMI) ports of the Gnu tools available at
> http://www.delorie.com/
> They are absolute magic for anybody who is frustrated by Windows and has
> not yet moved to Gnu/Linux. I find "gawk" (or "awk") especially useful.
Can I emphatically *not* recommend those, but the genuine Windows ones at
http://www.cygwin.com instead? Particularly if you have NT/2000 and NTFS
file systems. The DJGPP tools have frustrating limitations (e.g. on line
lengths) even for Win95 users. And DJ Delorie works for Cygwin anyway.
As I posted earlier, a subset of the Cygwin tools and some others
have been collected for building R, at
http://www.stats.ox.ac.uk/pub/Rtools
We keep these up to date, but not in the (common) cases where the latest
versions are broken. What is there will have built R in the last few days.
There quite a few other Unix-alike tool sets around, but none that
I have found anything like as good.
--
Brian D. Ripley, ripley at stats.ox.ac.uk
Professor of Applied Statistics, http://www.stats.ox.ac.uk/~ripley/
University of Oxford, Tel: +44 1865 272861 (self)
1 South Parks Road, +44 1865 272860 (secr)
Oxford OX1 3TG, UK Fax: +44 1865 272595
-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-
r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html
Send "info", "help", or "[un]subscribe"
(in the "body", not the subject !) To: r-help-request at stat.math.ethz.ch
_._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._
More information about the R-help
mailing list