[R-sig-ME] multiple variance structure in lmer giving zero variances
Ben Bolker
bbolker at gmail.com
Mon Jun 4 11:02:10 CEST 2012
Sara Krause <skkrause at ...> writes:
> I apologize for the cross-post. posted to the regular list serve
> first and received a suggestion from someone else that I post here
> instead. ... I should have been clear originally that I had posted
> on the regular listserve first.
Not a big deal, now you know.
> Your response was very useful in helping me move along in this analysis. I
> have addressed several of your comments in more detail below but to
> summarize my email, these are my current questions:
>
> 1. Is it possible for me to still use the nlme package? If so, do I use
> the pdBlocked structure to incorporate my random effects? And if I do use
> this structure, could someone point me towards a resource to figure out how
> to create my groupedData? I'm sure my lack of understanding here is a
> statisitical, rather than R code issue.
I don't think groupedData is absolutely necessary. I'll leave the
pdBlocked question for someone else ...
> 2. It is unclear to me how to deal with the problem of having my
> categorical variables occur in the fixed and random effects. Any insight
> on this would be helpful.
> 3. Based on the paper you sent and a look at my data, it does seem like I
> need to incorporate the trt interactions but this exacerbates the problem
> of an overfitted model.
Yes, but in this case you just can't -- because each individual only
gets a single treatment. So the interaction might be there, but you
can't estimate it from your data (which gets you off the hook).
> 4. I tried the model that you kindly provided and received an error. See
> below for details.
> I was switching to lme4 because I didn't think that I could use the correct
> random terms in nlme. I would actually prefer to stick with nlme if it is
> possible to do so. I have a few simpler models that I have already run in
> nlme so it would be nice to not have to re-run all of them. I understand
> the p-values provided are a bit controversial but journals like them anyway
> so they are nice to have.
> My best guess for how to do this based on what I could figure out and your
> suggested lmer model is the following
>
> m1<-lme(y=trt*state*season*site, data=group,
> random=pdBlocked(list(pdIdent(~state|ID), pdIdent(~season|ID-1))),
> method=("REML"))
> However, I have not been able to find any documentation on how to correctly
> create the groupedData object. To make my model run, I used
> group=groupedData(y~1|state, data=d, order.groups=FALSE)
> If there is a way to run this model correctly in nlme, I would greatly
> appreciate your insight on this.
Anyone else want to chime in on this ... ?
> You also raised a couple of other issues in your email that I am not sure
> how to address. You stated
> > In general, categorical predictors should not appear in both
> > the fixed and random parts of the model -- that constitutes overfitting.
> >
> > My effect of interest in this model is actually the interaction between
> treatment and state because I expect that the treated individuals but not
> the control will decline in (scrotal) size after (contraceptive) treatment.
> Thus, assuming I am not missing something, state must be in my fixed
> effects. I expect that there will be seasonal differences in size due to
> breeding season. This seasonal difference is interesting but ancillary to
> the main question so perhaps I could take season out of the fixed effects.
I think you're OK: (state|ID) and (season|ID) are putting *interactions*
in the model. What would definitely break would be (1|state) or (1|season)
(i.e. using fixed effects as *grouping variables* for the random effects).
I think the model is fine as it is.
> The other issue you raised was this one:
>
> > Depending on the structure of your data you should also strongly
> > consider including interactions between treatment and your random
> > factors (see Schielzeth, Holger, and Wolfgang
> > Forstmeier. 2009. Conclusions Beyond Support: Overconfident Estimates
> > in Mixed Models. Behavioral Ecology 20 (2):416420.
> > doi:10.1093/beheco/arn145.
> > http://www.ncbi.nlm.nih.gov/pubmed/19461866.). However, you can only
> > include treatment interactions that make sense.
> I asked about this and he said that the interactions with ID were most
> important but this paper does make a good argument for it.
Do you mean "not" above (and "he" is your statistician)? I can
imagine him saying "not" because they are unidentifiable based on your
design -- so it's not that they're not important, just that you can't
do anything about it.
[snip]
I claim you're already including these interactions (as
interactions of *fixed* effects).
[snip]
> It sounds like each
> > individual is allocated to a single treatment, so (guessing that your
> > statistician means that you want the *interactions* of ID with state
> > and season) I think you should probably use
>
> y~trt*season*state*site+(state|ID)+(0+season|ID)
> >
>
> Yes, the interactions of ID with state and season is exactly what I am
> looking for.
> Also, your suggestion of a log transformation did take care of the problem
> of the unequal variances so thankfully, I should not have to deal with that
> in a more complicated way.
> Unfortunately, the model as above but on log transformed data resulted in
> the following error
> In mer_finalize(ans) : singular convergence (7)
> It is unclear to me why this is occurring.
Probably still somewhat overfitted. Try simplifying the model
a bit and see if you can get somewhere (e.g. take out the four-way
interaction and use (trt+season+state+site)^3 instead?
More information about the R-sig-mixed-models
mailing list