[R] proportions confidence intervals
Spencer Graves
spencer.graves at pdf.com
Mon Jul 12 19:44:25 CEST 2004
Please see:
Brown, Cai and DasGupta (2001) Statistical Science, 16: 101-133
and (2002) Annals of Statistics, 30: 160-2001
They show that the actual coverage probability of the standard
approximate confidence intervals for a binomial proportion are quite
poor, while the standard asymptotic theory applied to logits produces
rather better answers.
I would expect "confint.glm" in library(MASS) to give decent
results, possibly the best available without a very careful study of
this particular question. Consider the following:
library(MASS)# needed for confint.glm
library(boot)# needed for inv.logit
DF10 <- data.frame(y=.1, size=10)
DF100 <- data.frame(y=.1, size=100)
fit10 <- glm(y~1, family=binomial, data=DF10, weights=size)
fit100 <- glm(y~1, family=binomial, data=DF100, weights=size)
inv.logit(coef(fit10))
(CI10 <- confint(fit10))
(CI100 <- confint(fit100))
inv.logit(CI10)
inv.logit(CI100)
In R 1.9.1, Windows 2000, I got the following:
> inv.logit(coef(fit10))
(Intercept)
0.1
>
> (CI10 <- confint(fit10))
Waiting for profiling to be done...
2.5 % 97.5 %
-5.1122123 -0.5258854
> (CI100 <- confint(fit100))
Waiting for profiling to be done...
2.5 % 97.5 %
-2.915193 -1.594401
>
> inv.logit(CI10)
2.5 % 97.5 %
0.005986688 0.371477058
> inv.logit(CI100)
2.5 % 97.5 %
0.0514076 0.1687655
>
> (naiveCI10 <- .1+c(-2, 2)*sqrt(.1*.9/10))
[1] -0.08973666 0.28973666
> (naiveCI100 <- .1+c(-2, 2)*sqrt(.1*.9/100))
[1] 0.04 0.16
hope this helps. spencer graves
Darren Shaw wrote:
> Dear R users
>
> this may be a simple question - but i would appreciate any thoughts
>
> does anyone know how you would get one lower and one upper confidence
> interval for a set of data that consists of proportions. i.e. taking
> a usual confidence interval for normal data would result in the lower
> confidence interval being negative - which is not possible given the
> data (which is constrained between 0 and 1)
>
> i can see how you calculate a upper and lower confidence interval for
> a single proportion, but not for a set of proportions
>
> many thanks
>
>
> Darren Shaw
>
>
>
> -----------------------------------------------------------------
> Dr Darren J Shaw
> Centre for Tropical Veterinary Medicine (CTVM)
> The University of Edinburgh
> Scotland, EH25 9RG, UK
>
> ______________________________________________
> R-help at stat.math.ethz.ch mailing list
> https://www.stat.math.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide!
> http://www.R-project.org/posting-guide.html
More information about the R-help
mailing list