[R-sig-ME] reference on unidentifiability of, observation-level variance in Bernoulli variables?
Highland Statistics Ltd
highstat at highstat.com
Wed Nov 16 14:39:30 CET 2011
Message: 2
Date: Tue, 15 Nov 2011 15:53:07 -0500
From: Ben Bolker<bbolker at gmail.com>
To: r-sig-mixed-models<r-sig-mixed-models at r-project.org>
Subject: [R-sig-ME] reference on unidentifiability of
observation-level variance in Bernoulli variables?
Message-ID:<4EC2D133.6040004 at gmail.com>
Content-Type: text/plain; charset=ISO-8859-1
Hi folks,
It has often been discussed on this list and other R-help lists that
overdispersion is (broadly speaking) unidentifiable for an ungrouped
Bernoulli response variable (if some kind of grouping can be imposed,
then it becomes identifiable). For example:
https://stat.ethz.ch/pipermail/r-help/2008-February/154058.html
http://finzi.psych.upenn.edu/R/Rhelp02a/archive/91242.html
Peter Dalgaard:
> The point being that you cannot have a distribution on {0, 1} where>
the variance is anything but p(1-p) where p is the mean; if you put> a
distribution on p and integrate it out, you still end up with the>
same variance.
I am curious if anyone has a *printed* (book or peer-reviewed
article) for this, or even can point to notes that actually go to the
trouble of doing the integration and proving the statement. I have
looked in some of the usual places (MASS; McCullough, Searle, and
Neuhaus; Zuur et al.) and haven't come across anything.
Something that showed the computation of the intraclass correlation
and worked out the mean of the logistic-normal-binomial distribution as
a function of the mean and variance of the underlying normal
distribution would be nice too, although I'm guessing it doesn't have a
straightforward analytical solution ...
Ben,
An expression for the intraclass correlation in a Poisson or binomial GLMM can be found in:
Goldstein et al. (2002)
Browne et al. (2005)
For a Poisson GLMM the calculations are simple. For a Binomial GLMM you need to do approximations or simulations.
For a 2-way nested ZIP model the intraclass correlation is given in Chapter 4 of:
Zero Inflated Models and Generalized Linear Mixed Models with WinBUGS and R (2012). Zuur et al.
Alain
Refs:
Goldstein H, Browne W, Rasbash J (2002). Partitioning Variation in Multilevel Models. Understand-ing Statistics. Understanding Statistics, 1(4), 223-231
Browne WJ, Subramanian SV, Jones K (2005). Variance partitioning in multilevel logistic models that exhibit overdispersion. J. R. Statist. Soc. A (2005) 168, Part 3, pp. 599–613
I'm hoping I haven't missed anything obvious -- on the other hand, if
I have it will be easy for someone to answer (and please don't be
offended if it was in your book, which was on my shelf all the time and
I forgot to look there ...)
thanks
Ben Bolker
More information about the R-sig-mixed-models
mailing list