[R-sig-ME] lme4 (V.1.1-35.5) Convergence Warnings & Model Reliability
Paul Johnson
p@u|john32 @end|ng |rom gm@||@com
Fri Dec 20 23:48:44 CET 2024
Hi, Manny:
Can you give some details to help us understand this? I'm guessing there
is collinearity in the estimation process, but you need to help us with
some details. If you are allowed to share the full input, let us see.
Almost for sure, there's something unexpected in your data structure.
1 It would help to know how big the sample is and how many individual
observations there are for each person.
I've seen the error you describe when the numbers of observations are very
small in some groups and the presence of cases in these small groups also
coincide with the other predictors in the model.
2 You refer to several variables as confounders, which is a new usage of
that term for me. Lets get some diagnostics. Please fit this with an
ordinary regression and then diagnose the multicollinearity among the
predictors.
3. How about fitting the model with just the one random effect term and
then review the magnitude of the differences in the predictions for the
groups of observations.
I think its also possible that the assumption that the effects are normally
distributed is violated and the calculation process cannot find a way to
tell you that.
PJ
On Tue, Dec 17, 2024 at 3:36 PM Vazquez Sanchez, Manuel via
R-sig-mixed-models <r-sig-mixed-models using r-project.org> wrote:
> Dear R-SIG-Mixed-Models Maling List Recipients,
>
>
> I hope this email finds you well. My name is Manny, and I am a
> biostatistics analyst at NYU Langone Health in New York, NY. I am reaching
> out with a question regarding the functionality of the lme4 (Version
> 1.1-35.5) package, specifically its handling of model convergence in
> longitudinal data analyses.
>
> One of the projects I am collaborating on involves analyzing longitudinal
> data from the Women's Interagency HIV Study (WIHS). Our goal is to explore
> the "syndemic" effects of individual-level exposures—such as depression,
> physical and sexual abuse, and drug use—on health outcomes like sleep
> quality and hypertension. The study cohort included 2,345 participants with
> each participant having 1-36 visits. We are using longitudinal,
> mixed-effects models to determine how the risk of hypertension status
> (binary outcome) varies with different combinations of these exposures
> while accounting for the repeated measurement over time as random effect.
>
>
>
> For this analysis, we are using the glmer() function from the lme4
> package. Example model:
>
>
> glmer(HTN ~ AGEATBL + RACE + EDUCATION + MARRIED + INCOME +
> YearsFromBaseline + DRUGUSE + (1|ID), family = binomial(link = “logit”))
>
>
>
> AGEATBL, RACE, EDUCATION, MARRIED, INCOME, and YearsFromBaseline are
> confounders, and DRUGUSE is one of the syndemic factors. The random effects
> specified (1|ID) are to account for differences between individual
> participants.
>
> In our analyses, we encounter a consistent issue: almost all of the models
> return convergence warnings of the form:
>
>
> Warning message:
>
>
>
> In checkConv(attr(opt, "derivs"), opt$par, ctrl = control$checkConv, :
>
>
>
> Model failed to converge with max|grad| = 0.794027 (tol = 0.002,
> component 1)
>
> To address this, we have tried the following strategies:
>
> * Changing the optimizer and increasing the number of iterations,
> including using the built-in lme4 functions to identify the best optimizer.
> * Reducing the dataset by limiting the number of participants to
> decrease the levels of random effects.
> * Excluding participants with either too few or too many visits to
> regularize the time in study for the included participants.
> *
> Employing confounder variable selection procedures for dimensionality
> reduction.
>
>
> Despite these efforts, the warnings persist. While the models do not
> produce errors, the warnings raise concerns about the reliability of our
> results. We also checked the documentation on lme4 regarding performance
> tips. Setting `control=glmerControl(calc.derivs=FALSE)` eliminates the
> warnings, but we need to report CI's and standard deviations of fixed
> effects parameters for our paper.
>
>
> I have two primary questions:
>
> 1. How reliable are the results when convergence warnings are present?
> * Are the parameter estimates and standard errors likely to be
> biased or misleading?
> 2. If the results are not reliable, are there additional strategies you
> recommend to improve model convergence or performance?
> *
> For instance, would penalized likelihood approaches or modifications to
> the random effects structure be viable options?
>
>
>
> I appreciate the robust functionality of the lme4 package and its critical
> role in facilitating complex modeling. Any guidance you can provide on
> addressing these convergence issues would be immensely helpful for our
> work. Thank you for your time and invaluable contribution to the research
> community. I look forward to any and all insights.
>
>
>
> Best regards,
>
> Manny
>
>
> Manuel R. Vazquez-Sanchez, MS
> Asst Research Scientist
> Department of Population Health
> Division of Biostatistics
> NYU Grossman School of Medicine
> Manuel.VazquezSanchez using nyulangone.org
> (917) 581 2034
>
> NYU Langone Health
> 180 Madison Avenue, 2nd Floor #2-32A
> New York, NY 10016
>
> [[alternative HTML version deleted]]
>
> _______________________________________________
> R-sig-mixed-models using r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-sig-mixed-models
>
--
Paul E. Johnson http://pj.freefaculty.org
Director, Center for Research Methods and Data Analysis http://crmda.ku.edu
To write to me directly, please address me at pauljohn at ku.edu.
[[alternative HTML version deleted]]
More information about the R-sig-mixed-models
mailing list