[R-sig-ME] Very large HPDintervals, mcmcsamp, lmer

ONKELINX, Thierry Thierry.ONKELINX at inbo.be
Tue Nov 22 09:59:42 CET 2011


Dear Colin,

Your sample size is too small for you model. You model has 7 fixed and 1 random parameter. You could use 10 observations per parameters as a rule of thumb. Hence your model would require about 80 obversations.

This leaves you with two options: a) collect more data b) reduce the model.

Best regards,

Thierry

> -----Oorspronkelijk bericht-----
> Van: r-sig-mixed-models-bounces at r-project.org [mailto:r-sig-mixed-models-
> bounces at r-project.org] Namens Colin Wahl
> Verzonden: maandag 21 november 2011 23:54
> Aan: r-sig-mixed-models at r-project.org
> Onderwerp: [R-sig-ME] Very large HPDintervals, mcmcsamp, lmer
> 
> Hello all,
> I am using lmer to describe patterns in stream variables between streams
> draining watersheds with different land uses. I need to make some kind of
> determination of significance. I understand that mixed models cannot provide
> accurate p values because the denominator df is unknown. As per D. Bates
> recommendation in his post "lmer, p-values and all that" I used mcmcsamp and
> HPDinterval to get 95% "confidence intervals." These intervals were very large
> and suggest that there are no significant differences in variables where
> differences seem obvious. I'll use one of my variables as an example: total
> stream nitrogen. There are 12 streams and 24 sites (two riparian types per
> stream).
> 
> model<-lmer(TN ~ wsh*rip + (1|stream), data=all24)
>    Data: all24
>    AIC   BIC logLik deviance REMLdev
>  78.47 90.25 -29.24    65.15   58.47
> Random effects:
>  Groups   Name        Variance Std.Dev.
>  stream   (Intercept) 4.10443  2.0259
>  Residual             0.20885  0.4570
> Number of obs: 24, groups: stream, 12
> 
> Fixed effects:
>                  Estimate Std. Error t value
> (Intercept)  6.04887    1.19903   5.045
> wshD        -4.65847    1.69569  -2.747
> wshF        -4.85194    1.58617  -3.059
> wshG        -5.18847    1.89584  -2.737
> ripN            -0.18540    0.37314  -0.497
> wshD:ripN    0.15800    0.52770   0.299
> wshF:ripN    0.09825    0.49362   0.199
> wshG:ripN    0.00915    0.58998   0.016
> 
> Estimates are accurate; intercept(cultivated) watersheds clearly have a much
> higher nitrogen concentration than the other watersheds. I use mcmcsamp and
> HPD interval to get 95% CIs:
> 
> model.mcmc<-mcmcsamp(model, n=1000)
> model.HPDi<-HPDinterval(model.mcmc, prob=0.95)
> 
>                       Estimate          Std. Error            t
>    lower                upper
> Cultivated       6.04886663   1.19903192   5.04479200   3.75759240   8.40959042
> Developed        1.39040003   1.69568720  -2.74724406  -4.27036665
> 6.80232811
> Forested         1.19692503   1.58617014  -3.05890364  -4.53524163   6.52712926
> Grassland        0.86039999   1.89583593  -2.73676987  -4.96308190   6.93964588
> Riparia(Cult)    5.86346654   0.37313911  -0.49686587   0.52284572
> 11.44574598
> Riparia(Devel)   1.36300000   0.52769839   0.29941356 -12.15807916
> 14.22417096
> Riparia(Fores)   1.10977501   0.49361665   0.19904123 -11.93267982
> 13.77391850
> Riparia(Grass)   0.68414998   0.58998474   0.01550901 -13.67422625
> 14.50577622
> 
> Considering that the CI are substantially overlapped, I interpret these CIs to
> suggest there is no detectable differences between cultivated streams and other
> streams. This cant be right.
> 
> I have a very elementary understanding of Bayesian statistics, and have not used
> mcmc or HPDinterval before. Is it apparent that I've done something wrong? Is
> my sample size too small? I hope this isnt just a problem with calculating CIs
> from the output, which I'm pretty sure is done the same way its done with lmer
> (see below).
> 
> Please help; I'm desperate to wrap up this analysis.
> 
> Thank you,
> Colin Wahl
> Graduate Student
> Western Wash. University
> Bellingham, WA USA
> 
> 
> 
> CI recalculation:
> 
> I would be confident that CIs for each fixed effect are relative to the intercept,
> but the CIs look wrong so I am questioning myself. Is it true that the
> HPDintervals for each fixed effect are relative to the intercept coefficient, just
> as they are in the initial lmer output?
> 
> i.e...
>                 lower     upper
> (Intercept)  3.684270  8.626003
> wshD        -8.256167 -1.259411
> wshF        -8.469073 -1.852237
> wshG        -9.338573 -1.498553
> ripN        -3.309220  3.105756
> wshD:ripN   -4.412022  4.944243
> wshF:ripN   -4.452947  4.422371
> wshG:ripN   -5.510786  4.917613
> 
> becomes:
> 
>                   lower     upper
> (Intercept)   3.6842697  8.626003
> wshD         -4.5718974  7.366592
> wshF         -4.7848029  6.773766
> wshG         -5.6543034  7.127450
> ripN          0.3750498 11.731759
> ripD        -12.2931391 15.416590
> ripF        -12.5469694 14.301893
> ripG        -14.4743090 15.150819
> 
> Using the following code:
> ci<-as.matrix(model.HPDi$fixef)
> C=ci[1,]
> D=C+ci[2,]
> FF=C+ci[3,]
> G=C+ci[4,]
> CN=C+ci[5,]
> DN=(D)+(ci[5,]+ci[6,])
> FN=(FF)+(ci[5,]+ci[7,])
> GN=(G)+(ci[5,]+ci[8,])
> confint<-c(C,D,FF,G,CN,DN,FN,GN)
> 
> 
> --
> CW
> 
> _______________________________________________
> R-sig-mixed-models at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-sig-mixed-models




More information about the R-sig-mixed-models mailing list