[R] a weird "cut" question

William Dunlap wdunlap at tibco.com
Thu Dec 8 04:32:49 CET 2011


You could use quantile() to create the breakpoints
  > x <- c(47677, 47602, 47678, 47905, 47909, 47906, 47605, 47673, 47607)
  > cutX <- cut(x, breaks=quantile(x, (0:3)/3), include.lowest=TRUE)
  > cutX
  [1] (4.77e+04,4.78e+04] [4.76e+04,4.77e+04] (4.77e+04,4.78e+04] (4.78e+04,4.79e+04]
  [5] (4.78e+04,4.79e+04] (4.78e+04,4.79e+04] [4.76e+04,4.77e+04] (4.77e+04,4.78e+04]
  [9] [4.76e+04,4.77e+04]
  Levels: [4.76e+04,4.77e+04] (4.77e+04,4.78e+04] (4.78e+04,4.79e+04]
  > table(cutX)
  cutX
  [4.76e+04,4.77e+04] (4.77e+04,4.78e+04] (4.78e+04,4.79e+04] 
                  3                   3                   3

This will fail if there are only 2 distinct values in the dataset.

Bill Dunlap
Spotfire, TIBCO Software
wdunlap tibco.com 

> -----Original Message-----
> From: r-help-bounces at r-project.org [mailto:r-help-bounces at r-project.org] On Behalf Of Erin Hodgess
> Sent: Wednesday, December 07, 2011 7:15 PM
> To: R help
> Subject: [R] a weird "cut" question
> 
> Dear R People:
> 
> I have the following data:
> 
> > ail.df[,1]
> [1] 47677 47602 47678 47905 47909 47906 47605 47673 47607
> > cut(ail.df[,1],breaks=3)
> [1] (4.76e+04,4.77e+04] (4.76e+04,4.77e+04] (4.76e+04,4.77e+04]
> [4] (4.78e+04,4.79e+04] (4.78e+04,4.79e+04] (4.78e+04,4.79e+04]
> [7] (4.76e+04,4.77e+04] (4.76e+04,4.77e+04] (4.76e+04,4.77e+04]
> Levels: (4.76e+04,4.77e+04] (4.77e+04,4.78e+04] (4.78e+04,4.79e+04]
> >
> so I have cut ail.df[,1] into 3 levels.  However, the second level
> never appears in the data set.
> 
> Is there a way to set cut such that every level appears, please?
> 
> thanks in advance,
> Sincerely,
> Erin
> 
> 
> --
> Erin Hodgess
> Associate Professor
> Department of Computer and Mathematical Sciences
> University of Houston - Downtown
> mailto: erinm.hodgess at gmail.com
> 
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.



More information about the R-help mailing list