[R] Setting breaks to data more appropriately
David Winsemius
dwinsemius at comcast.net
Tue Mar 23 23:23:18 CET 2010
On Mar 23, 2010, at 5:23 PM, ROLL Josh F wrote:
> David
> Thanks you for the guidance. I am looking to drive the data into
> the intervals. Problem now is that I have a few 0 values that are
> returning as NA (e.g -> (0.1,0.2] (0.1,0.2] (0.1,0.2] <NA>
> (0,0.1] )
> This is causing issues when I try and use the brks to inform a
> clorepleth map. I tried setting all na's to 0 but because the
> dataset is in factor form (I guess this is why) it wont allow me to
> do so. Thoughts about how to handle the zero values? Thanks
> JR
>
?cut
By default include.lowest is set to FALSE. Setting it to TRUE should
cure your missing zeroes problem. (My own feeling is that was a very
poor choice of defaults but I guess it is cast in stone now.)
You don't say how you tried setting NA's to zero (and doing so is not
a good idea if there are real zeroes in the data), but the is.na and
is.na<- functions are often needed for that purpose.
--
David.
> -----Original Message-----
> From: David Winsemius [mailto:dwinsemius at comcast.net]
> Sent: Monday, March 22, 2010 12:46 PM
> To: ROLL Josh F
> Cc: r-help at r-project.org
> Subject: Re: [R] Setting breaks to data more appropriately
>
>
> On Mar 22, 2010, at 1:49 PM, LCOG1 wrote:
>
>>
>> Basic question. For the below data, i would like to but each of the
>> values in a bin that represents their value. So the below would
>> hopefully put .1 in the 0-.1 bin, .2 in the .11-.2 bin and so forth.
>> The outlying values would then be put into and outer category
>> representing everything
>>> 1. Im
>> using the breaks to inform some code for making a clorepleth map that
>> represents probabilities, which in some cases IS greater than 1
>
> ... not if it's a quantile or a probability.
>
>> and i need
>> to identify those better.
>
> Define "better".
>
>> As my code stands now, my real data is put put into this form when
>> brks is called:
>>
>> 0% 10% 20% 30% 40%
>> 50% 60%
>> 0.00000000 0.05054675 0.07787235 0.11235238 0.14424786 0.18089360
>> 0.21475990
>> 70% 80% 90% 100%
>> 0.26309899 0.30807771 0.39478573 0.67573483.
>>
>> But what i want is for the values to be placed in bins corresponding
>> to their value(0-.1, .11-.2, .21-.3 etc)
>>
>> Pct.SFD<-c(.1,.2,.3,.4,.5,.6,.7,.8,.9,1,2,3)
>> brks <- quantile(Pct.SFD, )
>>
>> I think this is clear.
>
> It's not. You need to decide whether you want the breaking to be
> driven by you or by the data. If you are doing the driving then use
>
> cut(object, breaks=c(seq(0,1, by=0.1), Inf) , right=TRUE)
>
> If the data is doing the driving then:
>
> cut(object, breaks=quantile(object, probs= seq(0,1,1/10 ) ) ,
> right=TRUE)
>
> --
> David.
>
>> Thanks
>> --
>> View this message in context:
>> http://n4.nabble.com/Setting-breaks-to-data-more-appropriately-
>> tp16780
>> 19p1678019.html Sent from the R help mailing list archive at
>> Nabble.com.
>>
>> ______________________________________________
>> R-help at r-project.org mailing list
>> https://stat.ethz.ch/mailman/listinfo/r-help
>> PLEASE do read the posting guide
>> http://www.R-project.org/posting-guide.html
>> and provide commented, minimal, self-contained, reproducible code.
>
>
David Winsemius, MD
West Hartford, CT
More information about the R-help
mailing list