[R-sig-ME] random intercept and random slope
Douglas Bates
bates at stat.wisc.edu
Thu Feb 9 21:59:13 CET 2012
2012/2/9 André Barbosa <andre.frainer at emg.umu.se>:
> Dear list members,
> I have read several threads on this list about the use of random variables and its interpretation. I seem to have learned a lot about model fitting, from plotting the raw data and checking the slopes and intercepts, to getting rid of the p-value mindset in which I had had my basic statistics courses at university.
> However, since my statistical courses never covered Mixed Effect Models, I am still unsure if I am doing the right thing or not. Having said that, would you please take a look at the following data and see if my rational is correct? The model is quite simple, I believe.
> I have 8 different plant species, which were mixed two-by-two in all possible combinations. So, each species has 7 pairs + it being alone (monoculture). My response variable is a ratio (observed /expected productivity values, where observed is the productivity of species “a” achieved when mixed with another species, and expected is its value when in monoculture).
> Each species had its own nutrient content analyzed. As I had three nutrient variables measured from each plant, I calculated indices of dissimilarity for each of those pairs.
> My starting model (without specifying random or fixed effects) would be:
> ratio ~ dissimilarity | species
> I expected, based on previous studies, that the relationship between ratio and dissimilarity would yield different slopes for each species, from negative to positive – expectation confirmed by potting a xyplot function of my data. Thus, species should be random. Looking at the xyplot of my data, I also see that the intercepts are somehow variable, ranging between 0.5 and 1.5 (response data points do not extend much further from this range, either). For this reason, I thought on including intercepts as random, as well, which leave me without fixed variables.
> So, I decided to test:
> lmer(ratio ~ 1 + (dissimilarity|species))
> Here follows a subset of my data:
> species pair ratio dissimilarity
> a a+b 1.090935 1.870297012
> a a+c 1.182509 0.691033781
> a a+d 1.505538 1.441237522
> a a+e 1.547295 0.953060747
> a a+f 1.463782 1.306913498
> a a+g 1.197587 1.331087471
> a a+h 1.113263 1.097840225
> b b+a 0.899969 1.870297012
> b b+c 1.102478 1.548604304
> b b+d 1.218110 1.669409077
> b b+e 1.095748 1.536191709
> b b+f 1.306822 1.579788658
> b b+g 1.299480 1.084382658
> b b+h 1.219945 1.137927922
> c c+a 1.092199 1.441237522
> c c+b 1.486702 1.669409077
> c c+d 0.847517 1.688612802
> c c+e 0.210150 0.651183878
> c c+f 1.459219 1.064428069
> c c+g 0.87810 0.590191888
> c c+h 0.91223 1.37455314
> d d+a 1.32486 0.953060747
> d d+b 1.37737 1.536191709
> d d+c 1.23869 1.310607287
> d d+e 1.15714 0.651183878
> d d+f 0.97390 1.22540051
> d d+g 0.92355 0.640371057
> d d+h 0.79097 1.224534999
>
> The output from my model, using the whole data set is:
>> summary(random.model)
> Linear mixed model fit by REML
> Formula: ratio~ 1 + (dissimilarity | species)
> Data: k
> AIC BIC logLik deviance REMLdev
> 11.29 21.42 -0.6465 -3.273 1.293
> Random effects:
> Groups Name Variance Std.Dev. Corr
> species (Intercept) 0.092206 0.30365
> dissimilarity 0.030348 0.17421 -1.000
> Residual 0.048348 0.21988
> Number of obs: 56, groups: species, 8
>
> Fixed effects:
> Estimate Std. Error t value
> (Intercept) 1.18024 0.04087 28.88
>
> ----
>
> My questions:
>
> 1. Is my approach correct in plotting the independent variable “dissimilarity” as random intercept, as in ~ 1 + (dissimilarity|species)?
Probably not. The random effects are defined to have mean 0 so any
non-zero mean must occur in the fixed effects. Generally the model to
be fit is
ratio ~ 1 + dissumilarity + (1 + dissimilarlity|species)
which is equivalent to
ratio ~ dissimilarity + (dissimilarity|specties)
It is a matter of taste whether to include the 1+ or not in a model
formula like this. I find that doing so emphasizes to me where each
of the parameters come from.
> 2. On a publication, can I report the variance component of the random terms (in percentage) as the main result of the test?
> 3. Would it be possible to run McMC on a model that does not have Fixed Effects?
> 4. Would there be any other metrics that I should report as well?
>
> I am sure that my questions are pretty basic, but I would strongly appreciate any input from you. Thank you!
> Andre
>
> _______________________________________________
> R-sig-mixed-models at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-sig-mixed-models
More information about the R-sig-mixed-models
mailing list