[R-sig-ME] Question on mixed models, pseudoreplication and inflated degrees of freedom

Thu Aug 1 11:17:48 CEST 2013

Dear Tom,

Having SUGAR both in the fixed and the random effects in not relevant. The variance of the SUGAR random effect should be zero. Assuming each CLONE has a unique ID, you don't need the nesting either. The random effect of CLONE will model the effect of each clone while taking into account the fixed effects (which include SUGAR).

The A + B + A * B syntax is verbose. A * B will do. If you prefer to write the model explicit then I recommend using the A + B + A:B notation. A * B is shorthand for A + B + A : B

You might want to add an observation level random effect to capture potential overdispersion. Look carefully at the variance of this random effect. If it's small you should be ok. When it gets higher (above 1 or 2) you have quite strong overdispersion. In that case you better switch to a model that uses a more appropriate family (e.g. negative binomial).

data$ID <- seq_len(nrow(data))

So the model should be

M0 <- glmer(FITNESS ~  ANT_TENDED * SUGAR + C + (1|PLANT) + (1|CLONES) + (1|ID), family = poisson, data = data)

Or if the clones are sometime tended and sometimes not

glmer(FITNESS ~  ANT_TENDED * SUGAR + C + (1|PLANT) + (0 + ANT_TENDED|CLONES) + (1|ID), family = poisson, data = data)

Likelihoodratio tests should be ok.

M0 <- glmer(FITNESS ~  ANT_TENDED * SUGAR + C + (1|PLANT) + (1|CLONES) + (1|ID), family = poisson, data = data)
M1 <- glmer(FITNESS ~  ANT_TENDED + SUGAR + C + (1|PLANT) + (1|CLONES) + (1|ID), family = poisson, data = data)
anova(M0, M1)

Best regards,

ir. Thierry Onkelinx
Instituut voor natuur- en bosonderzoek / Research Institute for Nature and Forest
team Biometrie & Kwaliteitszorg / team Biometrics & Quality Assurance
Kliniekstraat 25
1070 Anderlecht
Belgium
+ 32 2 525 02 51
+ 32 54 43 61 85
Thierry.Onkelinx op inbo.be
www.inbo.be

To call in the statistician after the experiment is done may be no more than asking him to perform a post-mortem examination: he may be able to say what the experiment died of.
~ Sir Ronald Aylmer Fisher

The plural of anecdote is not data.
~ Roger Brinner

The combination of some data and an aching desire for an answer does not ensure that a reasonable answer can be extracted from a given body of data.
~ John Tukey

-----Oorspronkelijk bericht-----
Van: r-sig-mixed-models-bounces op r-project.org [mailto:r-sig-mixed-models-bounces op r-project.org] Namens Tom Wenseleers
Verzonden: donderdag 1 augustus 2013 2:11
Aan: r-sig-mixed-models op r-project.org
Onderwerp: [R-sig-ME] Question on mixed models, pseudoreplication and inflated degrees of freedom

Dear all,
I would appreciate any feedback on the following experimental setup and especially regarding a referee comment that suggested we were using inflated degrees of freedom and that our experiment suffered from pseudoreplication. So my question is whether the mixed model we use adequately takes into account different sources of dependency in our data, and hence resolves potential pseudoreplication.

The setup is with a clonal aphid species of which different genetic lines ( variable Clones, n=8) were used. Those lines could be divided into two groups for a certain characteristic (amount of SUGAR produced in their honeydew - Low or High). The measurements of the dependent variable (FITNESS, nr of aphids present 1 week after inoculating a plant) were repeated 4 times per clone on different plants (PLANT). Finally, there was one other factor with 2 categories (ANT TENDED - Yes or No, describing whether the aphid colonies were tended by ants or not), and a numerical covariate we would like to correct for (C).

We then used the following GLMM with poisson error structure
glmer(FITNESS~SUGAR+ANT_TENDED+ANT_TENDED*SUGAR+C+(1|PLANT)+(1|SUGAR/CLONES),family=poisson,data=data)

Significance of the fixed effects was then tested using Wald type tests and likelihood ratio tests.

My question is whether this model adequately captures our error structure?
Furthermore, are inflated degrees really an an issue with mixed models and how should this be reported?
I suppose this is only relevant in an ANOVA type analysis, right, but not in a mixed modelling context, since the likelihood ratio tests have 1 df anyway?
Or would it indeed apply if one assessed significance using an anova type methodology, e.g. using lmertest with Sattertwaithe or Kenward Roger df approximations?

Specific comments we received were:
"From the presented results we have no idea whether effects of melezitose type have indeed been tested over the right model (in a GLMM type analysis) or error term (clone within melezitose type), which actually should have quite low df in these experiments (in an ANOVA type analysis)."

"All scores used in the statistical tests should be averaged for four high-melezitose clones and four low-melezitose ones."
(This doesn't make sense, right, as that would represent a huge loss of information?)

"Because measurements on different aphids on the same plant are not independent, how did you account for this in the analysis?"
(This was covered through our inclusion of an observation (plant) level random factor, right?)

Would any of you be able to advise me on this by any chance?

Yours sincerely,
Tom Wenseleers

_______________________________________________________________________________________

Prof. Tom Wenseleers
P      Lab. of Socioecology and Social Evolution
           Dept. of Biology
           Zoological Institute
           K.U.Leuven
           Naamsestraat 59, box 2466
           B-3000 Leuven
           Belgium
  +32 (0)16 32 39 64 / +32 (0)472 40 45 96 //tom.wenseleers op bio.kuleuven.be http://bio.kuleuven.be/ento/wenseleers/twenseleers.htm

_______________________________________________
R-sig-mixed-models op r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-sig-mixed-models
* * * * * * * * * * * * * D I S C L A I M E R * * * * * * * * * * * * *
Dit bericht en eventuele bijlagen geven enkel de visie van de schrijver weer en binden het INBO onder geen enkel beding, zolang dit bericht niet bevestigd is door een geldig ondertekend document.
The views expressed in this message and any annex are purely those of the writer and may not be regarded as stating an official position of INBO, as long as the message is not confirmed by a duly signed document.