[R-sig-ME] mcmcglmm Priors: Auto correlation and extreme post mean values

Fri May 15 15:28:19 CEST 2020

Good day List,
My name is Alex, I am currently using the package mcmcglmm to determine the
impact of lunar cycles, human presence and land use type (agricultural vs
protected) on the trapping success of meso-predators for my PhD. I am new
to MCMCglmm and I was wondering if you could assist with my problem.

Structure of data: I am testing if and how the change in lunar cycle
(factor) and human presence ( factor) impact trapping success. Trapping
success is count data but because we only trapped 1 individual at most, the
data can also be considered binary. Human presence (HP) is split into 5
categories.

Model: I am running Human presence, Moon phase and Land use type as fixed
effects and the site name as a random effect.

Problem: I have quantified human presence in various locations, with areas
exposed to intense human pressures, there is a complete absence of any
successful trappings (HP2 and HP4 have no trapping success resulting in
only zeros). Post mcmcglmm, these parameters display auto correlation in
their graphs (if i set them as fixed or random effects) as well as when
using the geweke and gelman tests, but almost perfect mixing for the
others, especially when I run it with 5-20 million iterations. I also see
poorly mixed VCV graphs if the degree of belief is less than 20. I have
used a variety of priors, see below, with no success. I have also increased
the amount of itterations to 20 million, decreased the thinning factor,
increased the burnin and used poisson, zapoisson, zipoisson, ordinal and
threshold distributions, with threshold and ordinal having the best DIC
values. Together with this, I am also seeing extreme post mean values for
the models that are displaying the lowest DIC values, which is not
something that I see in any of the literature (see below).

I have read Jarrod Hadfields awesome course and tutorial notes, as well as
other literature in my field, online tutorials, information documents and
the vignettes and help functions in R as well as the correspondence on this
email list between Jarrod Hadfield and other users but I cannot seem to
figure out why the above is happening. My data set is quite small (176
entries in total, with about 40 entries per HP category) and according to
what I have read, priors can have a large impact on small to moderately
sized data sets.

Questions:
1. Does the problem lie with my priors? And if so, do you have any tips on
how to solve it?
2. Should I be using ordinal or threshold family? I have read that the
mixing of zi and za poisson models can be poor, and from my tries with
them, this seems to be the case.
3. Are either the extreme post mean values or the auto correlation in
certain parameters avoidable in this case?

*Examples of priors*
prior1<-list(R=list(V=diag(1)*1e-8, nu=0.2),G=list(G1=list(V=diag(1)*1e-6,
nu=0.2))
prior2<-list(R=list(V=diag(1), nu=0.2),G=list(G1=list(V=diag(1), nu=2)))
prior3<-list(R=list(V=diag(2), nu=0, fix=2), G=list(G1=G1))
G1=list(V=diag(2), nu=2, alpha.mu=c(0,0), alpha.V=diag(2)*1000)
prior4:
A note: I have these with degree belief ranging from 0.0002 to 20, with the
best mixing (apart from the auto correlation in certain parameters) with nu
set at 20 for the R structure. If i set the G structures degree of belief
to less than 20, it mixes the random effect extremely poorly. I have also
used ranging values from 1e-1 to 1e-20 and fixed the variances at 1 and 2
using various different priors.
*Examples of model scripts:*
OM1.1<-MCMCglmm(Jackal_trapped ~ Human_presence +  MP + Landuse, random
=~Site, data = data, prior = prior1,
family = "ordinal", nitt = 5000000, thin = 500, burnin = 500000, verbose =
FALSE, pl = TRUE, DIC=TRUE)
model1.1<-MCMCglmm(Jackal_trapped~trait,random=~us(trait):Human_presence +
us(trait):Landuse,family="zapoisson",rcov=~idh(trait):units,data=data,prior=prior,nitt=500000,thin=500,burnin=200000,verbose=FALSE)

model1.2<-MCMCglmm(Jackal_trapped~trait,random=~idh(trait):Human_presence +
idh(trait):Landuse,family="zapoisson",rcov=~idh(trait):units,data=data,prior=prior,nitt=500000,thin=500,burnin=200000,verbose=FALSE)

*Example of large post mean summary*
 family: I have used zapoisson, ordinal and zipoisson distributions
prior1<-list(R=list(V=diag(1)*1e-4, nu=0.2),G=list(G1=list(V=diag(1)*1e-2,
nu=2)))
OM1.1<-MCMCglmm(Jackal_trapped ~ Human_presence +  MP + Landuse, random
=~Site, data = data, prior = prior1, family = "family", nitt = 5000000,
thin = 500, burnin = 500000, verbose = FALSE, pl = TRUE, DIC=TRUE)
 Iterations = 500001:4999501
 Thinning interval  = 500
 Sample size  = 9000

 DIC: 0.003093406

 G-structure:  ~Site

     post.mean l-95% CI u-95% CI eff.samp
Site    0.1943 0.001154   0.1913     9000

 R-structure:  ~units

      post.mean l-95% CI u-95% CI eff.samp
units 1.755e+10 1.72e+09 3.97e+10     3632

 Location effects: Jackal_trapped ~ Human_presence + MP + Landuse
                              post.mean    l-95% CI  u-95% CI eff.samp
 pMCMC
(Intercept)                  -107789  -229797     9915     4666
0.0498 *
Human_presence1     49459   -12241      117809     6957         0.1002
Human_presence2    -49280  -153682      46279     8478          0.3184
Human_presence3     18642   -66603      104880     8529          0.6684
Human_presence4   -214029  -340008     -89259     6098          <1e-04 ***
MPFQ                         -44627  -119345      24385       7328
0.1856
MPNM                          12362   -52404      72882       9000
0.6691
MPTQ                           -43907  -122103    22055     8357
 0.1942
Landusereserve            -51105  -138603    35033     9000
 0.2302

*Example of model outputs with auto correlation*
family: I have used zapoisson, ordinal and zipoisson distributions
prior2<-list(R=list(V=diag(1)*1e-6, nu=20),G=list(G1=list(V=diag(1)*1e-6,
nu=20)))
OM2.1<-MCMCglmm(Jackal_trapped ~ Human_presence +  MP + Landuse, random
=~Site, data = data, prior = prior2,
family = "family", nitt = 5000000, thin = 500, burnin = 500000, verbose =
FALSE, pl = TRUE, DIC=TRUE)
 Iterations = 500001:4999501
 Thinning interval  = 500
 Sample size  = 9000

 DIC: 182.6921

 G-structure:  ~Site
     post.mean  l-95% CI  u-95% CI eff.samp
Site 1.114e-06 5.171e-07 1.911e-06     8515

 R-structure:  ~units

      post.mean  l-95% CI u-95% CI eff.samp
units 1.106e-06 5.129e-07 1.87e-06     9000

 Location effects: Jackal_trapped ~ Human_presence + MP + Landuse

                             post.mean l-95% CI u-95% CI eff.samp   pMCMC

(Intercept)               -0.70349 -1.20259 -0.28723    2.027     < 1e-04
***
Human_presence1   0.53752  0.41033  0.67240    6.030    < 1e-04 ***
Human_presence2  -0.44909 -0.65858 -0.03042    2.600    0.00289 **
Human_presence3  -0.11236 -0.48808  0.30079    1.683     0.53556
Human_presence4  -1.38806 -1.74144 -0.93892    2.829    < 1e-04 ***
MPFQ                        -0.38918 -0.57513 -0.23967    2.300    < 1e-04
***
MPNM                         0.14273 -0.05171  0.28830    1.625    0.14356

MPTQ                         -0.25338 -0.52899 -0.03162    1.477    0.01222
*
Landusereserve          -0.64469 -1.06522 -0.18880    2.239    < 1e-04 ***

*Example: Using my variables as random effects*
*family: I have used zapoisson, ordinal and zipoisson distributions*
model1.1<-MCMCglmm(Jackal_trapped~trait,random=~us(trait):Human_presence +
us(trait):Landuse,family="family",rcov=~idh(trait):units,data=data,prior=prior,nitt=500000,thin=500,burnin=200000,verbose=FALSE)
Iterations = 200001:499501
Thinning interval  = 500
Sample size  = 600

 DIC: 175.5933

 G-structure:  ~us(trait):Human_presence

        post.mean   l-95% CI  u-95% CI eff.samp
traitJackal_trapped:traitJackal_trapped.Human_presence 1.179e-06  5.584e-07
2.051e-06    472.7
traitza_Jackal_trapped:traitJackal_trapped.Human_presence    -1.243e-08
-5.964e-07 5.494e-07    600.0
traitJackal_trapped:traitza_Jackal_trapped.Human_presence    -1.243e-08
-5.964e-07 5.494e-07    600.0
traitza_Jackal_trapped:traitza_Jackal_trapped.Human_presence  1.172e-06
 5.252e-07 1.883e-06    507.3

 ~us(trait):Landuse

      post.mean   l-95% CI  u-95% CI eff.samp
traitJackal_trapped:traitJackal_trapped.Landuse       1.143e-06  4.876e-07
2.029e-06      600
traitza_Jackal_trapped:traitJackal_trapped.Landuse    7.237e-09 -5.758e-07
4.980e-07      600
traitJackal_trapped:traitza_Jackal_trapped.Landuse    7.237e-09 -5.758e-07
4.980e-07      600
traitza_Jackal_trapped:traitza_Jackal_trapped.Landuse 1.157e-06  4.896e-07
2.022e-06      600

 R-structure:  ~idh(trait):units

                             post.mean  l-95% CI  u-95% CI eff.samp
traitJackal_trapped.units    9.855e-07 4.689e-07 1.685e-06    704.6
traitza_Jackal_trapped.units 1.017e-06 4.490e-07 1.625e-06    600.0

 Location effects: Jackal_trapped ~ trait

                       post.mean l-95% CI u-95% CI eff.samp  pMCMC
(Intercept)               0.7789   0.7730   0.7875    8.419 <0.002 **
traitza_Jackal_trapped   -2.8911  -2.9052  -2.8741    6.982 <0.002 **

I hope I have shared enough info regarding my problem and that it makes
sense. I hope my post meets the requirements of the list and that it does
not seem like I am making my problem somebody elses, I am honestly just
lost at the moment.
I welcome any constructive criticism and any other help you can provide.

Thank you for your help.
I look forward to your responses.

With kind regards,

Alexander Edward Botha
alexbotha555 using gmail.com 082 414 9030
PhD candidate
Mammal ecology

	[[alternative HTML version deleted]]