[Rd] Apparant bug in binomial model in GLM (PR#13434)
Peter Dalgaard
P.Dalgaard at biostat.ku.dk
Wed Jan 7 16:48:03 CET 2009
Charles Geyer wrote:
...
> BTW the particular example given doesn't make clear WHAT question cannot
> be answered correctly. Some questions can be without fuss, for example
>
>> y <- c(0,0,0,0,0,1,1,1,1,1)
>> x <- seq(along = y)
>> out1 <- glm(y ~ x, family = binomial)
> Warning messages:
> 1: In glm.fit(x = X, y = Y, weights = weights, start = start, etastart = etastart, :
> algorithm did not converge
> 2: In glm.fit(x = X, y = Y, weights = weights, start = start, etastart = etastart, :
> fitted probabilities numerically 0 or 1 occurred
>> out0 <- glm(y ~ 1, family = binomial)
>> anova(out0, out1, test = "Chisq")
> Analysis of Deviance Table
>
> Model 1: y ~ 1
> Model 2: y ~ x
> Resid. Df Resid. Dev Df Deviance P(>|Chi|)
> 1 9 13.8629
> 2 8 7.865e-10 1 13.8629 0.0002
>
> This P-value (P = 0.0002) is valid, because the MLE does exist for the null
> hypothesis. Hence we see that we have to use the model y ~ x for which
> the MLE does not exist in the conventional sense.
It may be valid in some senses, but I can't help notice that it is off
by a factor of at least 10, since the experiment has only 1024 outcomes,
two of which are as extreme as the one observed, and where all outcomes
are equally likely under the corresponding y~1 model.
-p
--
O__ ---- Peter Dalgaard Øster Farimagsgade 5, Entr.B
c/ /'_ --- Dept. of Biostatistics PO Box 2099, 1014 Cph. K
(*) \(*) -- University of Copenhagen Denmark Ph: (+45) 35327918
~~~~~~~~~~ - (p.dalgaard at biostat.ku.dk) FAX: (+45) 35327907
More information about the R-devel
mailing list