[R-sig-ME] Orthogonal Polynomial contrasts with ordered factors

Steve Denham stevedrd at yahoo.com
Thu Dec 17 12:28:34 CET 2015


My real fear about fourth order polynomial by categorical interaction is that you may just be fitting noise.  The part about the plateau is what tipped me off to looking at a non-linear effect.  If only one interaction showed this, it would seem to me that the others aren't showing the kind of biological process observed here.
Have you looked at the residual error with increasing order of the polynomials?  If there is an inflation of this (or the other random effects), then it is likely an overfit equation. Steve Denham Director, Biostatistics MPI Research, Inc.
 

 
      From: Quentin Schorpp <quentin.schorpp at ti.bund.de>
 To: Steve Denham <stevedrd at yahoo.com>; "r-sig-mixed-models at r-project.org" <r-sig-mixed-models at r-project.org> 
 Sent: Thursday, December 17, 2015 3:08 AM
 Subject: Re: [R-sig-ME] Orthogonal Polynomial contrasts with ordered factors
   
 Hello,
 
 Thank you for your answer. If I understood you right, you mean significant effects  of higher order polynomials could point to non-linear relationships between response and explanatory variable (also if the explanator is a categorial factor), under the  condition, that  lower order polynomials in between are "skipped" in the sense that theses are not significant? The significant effect of ("non successive") higher order polynomials could be either in interaction terms and without? However when thinking about fitting a non-linear model, i'm wondering if it would be justified in the case that significant ("non successive") higher order polynomials only occur in one interaction case (the discussed pattern is only observed in the interaction with one level of the second interaction factor). 
 
 p.S. Sorry for the worse formatting of the initial question, while writing the e-mail everything looked good, especially the factor by factor table to reveal the partly nested structure.
 
 kind regards,
 Quentin
 
 
 Am 16.12.2015 um 12:38 schrieb Steve Denham:
  
  I'll take a crack at the B questions.  Keep in mind that this is an opinion only.  A significant fourth order by other factor interaction, especially when the lower order polynomial by factor interactions are not significant means that you have probably fit a polynomial to a non-polynomial effect.  Your mention of a plateau tends to support this, at least to me.  Some sort of non-linear effect (sigmoidal, four or five factor logistic) may make more sense from a biological perspective.   Steve Denham Director, Biostatistics MPI Research, Inc. 
 
      From: Quentin Schorpp <quentin.schorpp at ti.bund.de>
 To: "r-sig-mixed-models at r-project.org" <r-sig-mixed-models at r-project.org> 
 Sent: Tuesday, December 15, 2015 8:14 AM
 Subject: [R-sig-ME] Orthogonal Polynomial contrasts with ordered factors
  
 Hello,
 
 I would appreciate to get to know more about the use of polynomial 
 contrasts in lme4::glmer.
 Does anybody could give me an advice for literature about that subject.
 
 In particular
 A:  I read, that if a second order polynomial is significant in the 
 summary output, then it is supposed to be significant AFTER the first 
 order polynomial was                taken into account. Is that right?
 
 B1 : What happens if i use an ordered factor with another factor 
 (ordered or not) in an itneraction term? What does a signficant 
 interaction of the second                factor (any level) with the 
 fourth power polynomial of the first ordered factor tell me?
 B2: And waht does it tell me when the lower order polynomials are not 
 significant in the interaction?
 
 
 For more interested readers:
 
 _The data_
 My data is abundances of earthworms. I sampled 15 fields, three times 
 (samcam) during two years, with 4 pseudoreplicates per field (N=180).
 The factor age_class describes the stage of development of the field, it 
 has 5 levels ((n = 3 replicates).
 However, one of these levels A_Cm has n=6 since i had to switch the 
 fields in the second year.
 Field.ID is my random factor, to control for the pseudoreplication per 
 field and the longitudinal character of the data.  For the sake of less 
 complexity samcam stayed non-ordered.
 
 Here is the design
 
   field.ID\samcam1 2 3 1 4 4 4 2 4 4 4 3 4 4 4 4 4 4 4 5 4 4 4 6 4 4 4 7 4 
 4 4 8 4 4 4 9 4 4 4 10 4 4 4 11 4 4 4 _12 4 4 4_ 13 4 0 0 Fields had to 
 be switched in the second year 14 4 0 0 15 4 0 0 16 0 4 4 17 0 4 4 18 0 4 4
 
 
 
 Other continuous predictor variables were scaled before analysis.
 
 data structure:
   $ abundance      : num  0 0 3 3 2 1 2 5 12 5 ...
 
   $ ID            : Factor w/ 180 levels "1","2","3","4",..: 1 2 3 4 5 
 6 7 8 9 10 ...
   $ field.ID      : Factor w/ 18 levels "1","2","3","4",..: 1 1 1 1 2 2 
 2 2 3 3 ...
   $ age_class      : Ord.factor w/ 5 levels "A_Cm"<"B_Sp_young"<..: 5 5 
 5 5 5 5 5 5 5 5 ...
   $ samcam        : Factor w/ 3 levels "1","2","3": 1 1 1 1 1 1 1 1 1 1 ...
   $ hole          : Factor w/ 4 levels "1","2","3","4": 1 2 3 4 1 2 3 4 
 1 2 ...
 
   $ scl.pH        : num  -1.553 -1.553 -1.553 -1.553 0.715 ...
   $ scl.mc        : num  -1.072 -1.072 -1.072 -1.072 -0.429 ...
   $ scl.cn        : num  -0.703 -0.703 -0.703 -0.703 -0.474 ...
   $ scl.sand      : num  -0.245 -0.245 -0.245 -0.245 -0.0127 ...
   $ scl.silt      : num  -0.897 -0.897 -0.897 -0.897 -1.529 ...
   $ scl.clay      : num  1.19 1.19 1.19 1.19 1.66 ...
   $ scl.ata1      : num  1.6471 1.6471 1.6471 1.6471 0.0894 ...
   $ scl.atb1      : num  1.6658 1.6658 1.6658 1.6658 0.0659 ...
   $ scl.hum1      : num  -1.378 -1.378 -1.378 -1.378 0.429 ...
 
 _my hyptheses are_
 1. abundance increases with increasing age_class
 2. If abundance increases over the age classes it will be observed by 
 increasing abundance during the period of sampling
 (3. Abundance increases during the period of sampling)
 
 _The Model was :_
 best.mod <- glmer(abundance~ age_class*samcam + scl.prec1 + 
 scl.mc*scl.pH  + (1|field.ID)  ,data=data,family=poisson, 
 control=glmerControl(optimizer="bobyqa"))
 
 here is The Model output of one of the best models revealed by 
 MuMIn::dredge:
 
 Generalized linear mixed model fit by maximum likelihood (Laplace Approximation) ['glmerMod']
   Family: poisson  ( log )
 Formula: anc ~ age_class * samcam + I(scl.ats1^2) + scl.prec1 + (1 | field.ID)
     Data: data
 Control: glmerControl(optimizer = "bobyqa")
 
       AIC      BIC  logLik deviance df.resid
       731      789    -348      695      162
 
 Scaled residuals:
     Min    1Q Median    3Q    Max
 -2.181 -0.696 -0.171  0.644  4.178
 
 Random effects:
   Groups  Name        Variance Std.Dev.
   field.ID (Intercept) 0.263    0.513
 Number of obs: 180, groups:  field.ID, 18
 
 Fixed effects:
                     Estimate Std. Error z value Pr(>|z|)
 (Intercept)          0.6457    0.2079    3.11  0.0019 **
 age_class.L          1.9924    0.4416    4.51  6.4e-06 ***
 age_class.Q          -0.8644    0.4204  -2.06  0.0398 *
 age_class.C          -0.2373    0.4007  -0.59  0.5537
 age_class^4          0.7026    0.3591    1.96  0.0504 .
 samcam2              0.8549    0.2074    4.12  3.7e-05 ***
 samcam3              0.3074    0.1852    1.66  0.0969 .
 I(scl.ats1^2)        -0.3328    0.1038  -3.21  0.0013 **
 scl.prec1            0.2241    0.0851    2.63  0.0085 **
 age_class.L:samcam2  -0.3474    0.5463  -0.64  0.5248
 age_class.Q:samcam2  0.1074    0.4766    0.23  0.8216
 age_class.C:samcam2  0.8910    0.3644    2.44  0.0145 *
 _age_class^4:samcam2 -1.0352 0.2601 -3.98 6.9e-05 *** _  # HERE is the significant interaction of interest!
 age_class.L:samcam3  0.1274    0.5125    0.25  0.8038
 age_class.Q:samcam3  -0.1801    0.4429  -0.41  0.6842
 age_class.C:samcam3  0.5659    0.3615    1.57  0.1174
 _age_class^4:samcam3 -0.6489 0.2617 -2.48 0.0131 * ___# HERE is the significant interaction of interest!
 
 
 _Interpretation:_
 I understood, that the relationship was linear in general, as indicated 
 by the second line of the output, and this did not change between the 
 sampling campaigns. However, during the second and third sampling 
 campaign the relationship of abundances in the age_classes was 
 characterised by a stronger slope in younger classes and reached a 
 plateau afterwards, as indicated by the fourth power.
 The missing of the interaction between age_class and samcam1 is very 
 hard for me to understand
 
 I'm thankful for any advices!
 
 Quentin
 
 
 -- 
 Quentin Schorpp, M.Sc.
 Thünen-Institut für Biodiversität
 Bundesallee 50
 38116 Braunschweig (Germany)
 
 Tel:  +49 531 596-2524
 Fax:  +49 531 596-2599
 Mail: quentin.schorpp at ti.bund.de
 Web:  http://www.ti.bund.de
 
 Das Johann Heinrich von Thünen-Institut, Bundesforschungsinstitut für Ländliche Räume, Wald und Fischerei – kurz: Thünen-Institut –
 besteht aus 15 Fachinstituten, die in den Bereichen Ökonomie, Ökologie und Technologie forschen und die Politik beraten.
 
 Quentin Schorpp, M.Sc.
 Thünen Institute of Biodiversity
 Bundesallee 50
 38116 Braunschweig (Germany)
 
 Tel:  +49 531 596-2524
 Fax:  +49 531 596-2599
 Mail: quentin.schorpp at ti.bund.de
 Web:  http://www.ti.bund.de
 
 The Johann Heinrich von Thünen Institute, Federal Research Institute for Rural Areas, Forestry and Fisheries – Thünen Institute in brief –
 consists of 15 specialized institutes that carry out research and provide policy advice in the fields of economy, ecology and technology.
 
 
     [[alternative HTML version deleted]]
 
 _______________________________________________
 R-sig-mixed-models at r-project.org mailing list
 https://stat.ethz.ch/mailman/listinfo/r-sig-mixed-models

    
  

 -- 
Quentin Schorpp, M.Sc.
Thünen-Institut für Biodiversität
Bundesallee 50
38116 Braunschweig (Germany)

Tel:  +49 531 596-2524
Fax:  +49 531 596-2599
Mail: quentin.schorpp at ti.bund.de
Web:  http://www.ti.bund.de

Das Johann Heinrich von Thünen-Institut, Bundesforschungsinstitut für Ländliche Räume, Wald und Fischerei – kurz: Thünen-Institut – 
besteht aus 15 Fachinstituten, die in den Bereichen Ökonomie, Ökologie und Technologie forschen und die Politik beraten.

Quentin Schorpp, M.Sc.
Thünen Institute of Biodiversity
Bundesallee 50
38116 Braunschweig (Germany)

Tel:  +49 531 596-2524
Fax:  +49 531 596-2599
Mail: quentin.schorpp at ti.bund.de
Web:  http://www.ti.bund.de

The Johann Heinrich von Thünen Institute, Federal Research Institute for Rural Areas, Forestry and Fisheries – Thünen Institute in brief – 
consists of 15 specialized institutes that carry out research and provide policy advice in the fields of economy, ecology and technology.


   

	[[alternative HTML version deleted]]



More information about the R-sig-mixed-models mailing list