[R-sig-ME] lme4, cloglog vs. binomial link (peter dalgaard)

peter dalgaard pdalgd at gmail.com
Fri Jun 8 12:52:08 CEST 2012


Hi Murray,

I think this is pretty strongly related to proportional hazards modelling. If you are looking at it from a Poisson (*) process point of view, the rate of event per unit of time when observing a number of independent processes should be proportional to time and the number  of processes, and the probability of at least one event in a fixed length of time T is then 1 - exp(- n T lambda) (or 1 - exp (-n Lambda(T)) if you have a time-varying intensity, Lambda being the integrated intensity).

-pd

(*) Could be fun if this was actually about fish...

On Jun 6, 2012, at 23:21 , Murray Jorgensen wrote:

> *Hi Peter, Tibor et al.
> 
> I came across an ecological situation recently where a cloglog link seemed
> to be called for. I won't remove the context to the following explanatory
> note that I wrote but I'm sure the same kind of situation could be
> reasonably common:
> 
> 
> We wish to explore the probability of one or more females departing a
> cavity between two site visits as a function of the habitation state of the
> cavity at the first visit. More strictly we study the probability of a
> decrease in the number of females inhabiting the cavity between the two
> visits. Clearly this probability will be zero if no females inhabit the
> cavity at the first visit. More generally the probability will be larger as
> the number of female inhabitants increase as each has the opportunity to
> depart.
> 
> Although this dependance on the initial number of females is part of what
> we want to study we are more interested in questions such as the influence
> of the initial number of males on the probability of female decrease. We
> are indeed also interested in the effect of the initial numbers of females
> on the probability of female decrease, but more in the sense that we would
> like to know whether this is greater than, less than or equal to what would
> be predicted by a simple model.
> 
> One naive model that could be considered is that the decrease probability
> would be proportional to the number of females. This might work if the
> decrease probability was very low but for larger decrease probabilities
> would predict decrease probabilities greater than one. A less naive model
> would assume that each female departs with the same probability,
> independently of the other females.
> Then if the probability of a single female departing is $p$ and there are
> $x$ females in the cavity the probability of 1 or more departing is $p_x =
> 1 - (1-p)^x$.
> 
> The link function for the complementary log-log link is $\eta =
> \log(-\log(1-p))$. To examine the effect of multiple initial females we
> evaluate this at $p_x$.
> 
>                 1-p_x  =  (1-p)^x
>           \log(1-p_x)  =  x\log(1-p)
>    \log(-\log(1-p_x))  =  \log(x)+\log(-\log(1-p))
> 
> Thus the effect of an initial habitation of $x$ females is a shift of
> $\log(x)$ on the linear predictor scale if a complementary log-log link is
> used in a GLM or GLMM for the probability of female decrease. This means
> that the naive model can be accommodated by including $\log(x)$ as an
> offset.  If $x$ were also included as a covariate, a significant
> coefficient would indicate a departure from the naive model.
> 
> Regards, Murray
> 
> *
> 
>> Message: 5
>> Date: Wed, 6 Jun 2012 22:54:16 +0200
>> From: peter dalgaard <pdalgd at gmail.com>
>> To: Tibor Kiss <tibor at linguistics.rub.de>
>> Cc: r-sig-mixed-models at r-project.org
>> Subject: Re: [R-sig-ME] lme4, cloglog vs. binomial link
>> Message-ID: <7658F572-AEE2-4127-AB77-321B5B6C3D69 at gmail.com>
>> Content-Type: text/plain; charset=us-ascii
>> 
>> 
>> On Jun 4, 2012, at 13:07 , Tibor Kiss wrote:
>> 
>>> [...snippage...]
>>> My questions are as follows:
>>> 
>>> 1. Is it correct to assume that given a cloglog link, the less frequent
>> response should be considered the success?
>> 
>> No, cloglog is asymmetric, so it will make a difference which outcome is
>> considered success, but there is no mathematical reason to choose between
>> them. In survival data, the cloglog comes out of the proportional hazards
>> model when you have death within a fixed time period as the response (exact
>> date of death not recorded). In that case, death is "success" (!);
>> hopefully, it is the least likely outcome, but it might not be. If cloglog
>> is just used as a generic link function, then no such logic applies.
>> 
>>> 2. Is it correct to conclude that the changes in the model have led to
>> less influence of the random factor?
>> 
>> No. The scales are different. At the very least, you need to somehow
>> compare it to the fixed effects on the same scale.
>> 
>>> 3. How shall I react to the increase in AIC?
>> 
>> (Or, equivalently, the deviance). The cloglog link model seems to give the
>> worse fit to data.
>> 
>>> A final question, which may not have an answer at all: I am most curious
>> to learn about possible modifications of the model so that an observed
>> random effect can be minimized (while its presence cannot be denied).
>> 
>> First, is that desirable, and why? The only logic, that I can think of, is
>> that you want to get the fixed-effect part of the model right, so that the
>> error is not mistakenly taken as part of the random variation.
>> 
>> --
>> Peter Dalgaard, Professor,
>> Center for Statistics, Copenhagen Business School
>> Solbjerg Plads 3, 2000 Frederiksberg, Denmark
>> Phone: (+45)38153501
>> Email: pd.mes at cbs.dk  Priv: PDalgd at gmail.com
>> 
>> 
>> 
>> ------------------------------
>> 
>> _______________________________________________
>> R-sig-mixed-models mailing list
>> R-sig-mixed-models at r-project.org
>> https://stat.ethz.ch/mailman/listinfo/r-sig-mixed-models
>> 
>> 
>> End of R-sig-mixed-models Digest, Vol 66, Issue 10
>> **************************************************
>> 
> 
> 
> 
> -- 
> Dr Murray Jorgensen      http://www.stats.waikato.ac.nz/Staff/maj.html
> Department of Statistics, University of Waikato, Hamilton, New Zealand
> Email: maj at waikato.ac.nz    majorgensen at ihug.co.nz      Fax 7 838 4155
> Phone  +64 7 838 4773 wk    Home +64 7 825 0441   Mobile 021 0200 8350
> 
> 	[[alternative HTML version deleted]]
> 
> _______________________________________________
> R-sig-mixed-models at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-sig-mixed-models

-- 
Peter Dalgaard, Professor
Center for Statistics, Copenhagen Business School
Solbjerg Plads 3, 2000 Frederiksberg, Denmark
Phone: (+45)38153501
Email: pd.mes at cbs.dk  Priv: PDalgd at gmail.com



More information about the R-sig-mixed-models mailing list