[R] Binning or grouping data

Glen Sargeant gsargeant at usgs.gov
Thu Jun 4 17:23:05 CEST 2009




alamoboy wrote:
> 
> Newbie here.  Many apologies in advance for using the incorrect lingo. 
> I'm new to statistics and VERY new to R.
> 
> I'm attempting to "group" or "bin" data together in order to analyze them
> as a combined group rather than as discrete set.  I'll provide a simple
> example of the data for illustrative purposes.
> 
> Patient ID  |  Charges   |    Age  |   Race
> 1              |  100          |    0     |   Black
> 2              |  500          |    3     |   White
> 3              |  200          |    5     |   Hispanic
> 4              |   90           |    7     |   Asian
> 5              |    400        |   10     |   Hispanic 
> 6              |    500        |   16     |   Black
> 
> I'm trying to create three age categories--"0 to 4", "5 to 11" and "12 to
> 17"--and analyze their "Charges" by their "Race."  How do I go abouts to
> doing this?  
> 
> Thanks for any assistance!
> 
> 
> Sam
> 
> 
> 

Sam,

In addition to functions mentioned by other respondents, you may wish to
investigate findInterval(), which returns indices of bins.  The resulting
indices are very useful for subscripting as well as grouping.

> id
[1] 1 2 3 4 5 6

> age
[1]  0  3  5  7 10 16

> group <- findInterval(age,breaks)

> group
[1] 1 1 3 3 3 5

> data.frame(id,age,group)
  id age group
1  1   0     1
2  2   3     1
3  3   5     3
4  4   7     3
5  5  10     3
6  6  16     5

Glen


-- 
View this message in context: http://www.nabble.com/Binning-or-grouping-data-tp23864555p23872151.html
Sent from the R help mailing list archive at Nabble.com.




More information about the R-help mailing list