[R] How would I analyse data like this?
Thomas W Blackwell
tblackw at umich.edu
Wed Mar 19 18:22:19 CET 2003
Good for you for figuring out scan(). For hist(), append one more
number to "breaks" that's guaranteed to be above all the data values.
(In the return value from hist(), "breaks" has n+1 entries when the
histogram has n bars, and it's a good guess that input=output.)
hist(data[], br=c(50*(seq(11)-1), 1000, 1+max(data[])))
For counting, try the function table(). eg:
table(data[], data[], data[])
- tom blackwell - u michigan medical school - ann arbor -
On Wed, 19 Mar 2003 laurent.duperval at microcell.ca wrote:
> I'm a new R user and I'm having a little trouble getting started. I'm hoping
> someone can help me out.
> I have data that looks like this:
> 15555551234|3|983|1000|266|IN|2003-03-16 23:57:21-05|C
> 15555552345|3|983|3000|0|IN|2003-03-16 23:58:16-05|C
> 15555552346|3|983|1000|40|IN|2003-03-16 23:58:24-05|C
> Which I've read using scan().
> data <- scan(file = "data.dat", what = list("",0,0,0,0,"","",""), sep = "|", skip = 1)
> Now, I want to do things like this:
> - A histogram for the 5th column for every 50 units. I can generate the
> histogram but most of my values are between 0-500. A few are above that.
> I'd like to bundle them all in a generic 500+ category. I can't figure out
> how. This is what I'm doing
> hist(data[], br=c(0,50,100,150,200,250,300,350,400,450,500,1000))
> Error in hist.default(data[], br = c(0, 50, 100, 150, 200, 250, 300, :
> some `x' not counted; maybe `breaks' do not span range of `x'
> - How do I count the number of times channel "IN" occurs with code = 983? How about if
> I want to combine IN and code=983 or 982 or 981?
> - Finally (for today at least) how do I count the number of times code=983 and
> date=2003-03-16 (without the time) occur. I'm hoping this will also help
> me build histograms for days of the week and for hours of the day.
> Laurent Duperval <laurent.duperval at microcell.ca>
> ZYMURGY'S FIRST LAW OF EVOLVING SYSTEM DYNAMICS
> Once you open a can of worms, the only way to recan them is to use
> a larger can.
> R-help at stat.math.ethz.ch mailing list
More information about the R-help