[R] Count based on 2 conditions [Beginner Question]
David Winsemius
dwinsemius at comcast.net
Sun Sep 16 23:36:26 CEST 2012
On Sep 16, 2012, at 3:41 AM, SirRon wrote:
> Hello,
> I'm working with a dataset that has 2 columns and 1000 entries. Column 1 has
> either value 0 or 1, column 2 has values between 0 and 10. I would like to
> count how often Column 1 has the value 1, while Column 2 has a value greater
> 5.
>
> This is my attempt, which works but doesn't seem to be very efficient,
> especially when testing different values or columns.
>
> count=0
> for (i in 1:1000) { if(dataset[i,2]>5 && ind[i,1]==1) { count=count+1}}
>
> I'm looking for a more efficient/elegant way to do this!
>
I see others have given you a solution using the vectorized sum function. I would have reached for 'table' and done it thusly:
table( one=dataset[,1], GT5=dataset[ , 2] > 5 )
--
David Winsemius, MD
Alameda, CA, USA
More information about the R-help
mailing list