[R] How to change the number of bins?
David Winsemius
dw|n@em|u@ @end|ng |rom comc@@t@net
Mon Mar 11 02:39:27 CET 2019
On 3/10/19 5:29 PM, wong bowie wrote:
> You are right. Actually this variable represents the number of day
> passed after contacting a client, 999 means the client has never been
> contacted.
>
> But I am not supposed to change the value, am I?
I certainly would. SAS allows one to specify a value such as 999 to be
missing but R needs to have it changed to NA
is.na(Table$pdays) <- Table$pdays == 999
--
David
>
> David Winsemius <dwinsemius using comcast.net
> <mailto:dwinsemius using comcast.net>> 於 2019年3月10日 週日 下午10:48寫道:
>
> Seems rather likely that 999 is not really a measured value but
> rather
> is a missing value indicator.
>
>
> --
>
> David.
>
> On 3/10/19 1:54 PM, wong bowie wrote:
> > I wish to calculate the weight of evidence of a variable x, which is
> > positively skewed, with over 6000 of the observations are 999
> but only 200
> > range from 1-27. I used the code,
> >
> > “IV<-create_infotables(data=Test[,-1],y="class",bins=10)”
> >
> > However, no matter what number I used in bins parameter, I can
> only get 2
> > bins, [1,27] and [999,999]. Is there any way I can look into the
> [1,27]
> > closely because they represent a lot? The output from R is shown
> below,
> >
> > Table$pdays
> > pdays N Percent WOE IV
> > 1 [1,27] 243 0.03807584 2.6743166 0.5267751
> > 2 [999,999] 6139 0.96192416 -0.2230081 0.5707022
> >
> > Thank you very much!!
> >
> > [[alternative HTML version deleted]]
> >
> > ______________________________________________
> > R-help using r-project.org <mailto:R-help using r-project.org> mailing list
> -- To UNSUBSCRIBE and more, see
> > https://stat.ethz.ch/mailman/listinfo/r-help
> > PLEASE do read the posting guide
> http://www.R-project.org/posting-guide.html
> > and provide commented, minimal, self-contained, reproducible code.
>
[[alternative HTML version deleted]]
More information about the R-help
mailing list