[R] RDA and trend surface regression

Jari Oksanen jarioksa at sun3.oulu.fi
Tue Feb 27 21:38:15 CET 2007


On 27 Feb 2007, at 20:55, Gavin Simpson wrote:

> On Tue, 2007-02-27 at 13:13 -0500, Kuhn, Max wrote:
>> Helene,
>>
>> My point was only that RDA may fit a quadratic model for the terms
>> specified in your model. The terms that you had specified were already
>> higher order polynomials (some cubic). So a QDA classifier with the
>> model terms that you specified my be a fifth order polynomial in the
>> original data. I don't know the reference you cite or even the
>> subject-matter specifics. I'm just a simple cave man (for you SNL 
>> fans).
>> But I do know that there are more reliable ways to get nonlinear
>> classification boundaries than using x^5.
>
> I doubt that Helene is trying to do a classification - unless you
> consider classification to mean that all rows/samples are in different
> groups (i.e. n samples therefore n groups) - which is how RDA
> (Redundancy Analysis) is used in ecology.
>
> You could take a look at multispati in package ade4 for a different way
> to handle spatial constraints. There is also the principle coordinates
> analysis of neighbour matrices (PCNM) method - not sure this is coded
> anywhere in R yet though. Here are two references that may be useful:
>
Stéphane Dray has R code for finding PCNM matrices. Google for his 
name: it's not that common. I also have a copy of his function and can 
send it if really needed, but it may be better to check Dray's page 
first. Stéphane Dray says think that not all functions need be in CRAN. 
May be true, but I think it might help many people.

There are at least three reasons why not use polynomial constraints in 
RDA. Max Kuhn mentioned one: polynomials typically flip wildly at 
margins (or they are unstable in more neutral speech). Second reason is 
that they are almost impossible to interpret in ordination display. The 
third reason is that RDA (or CCA) avoid some ordination artefacts 
(curvature, horseshoe, arc etc.) just because the constraints are 
linear: allowing them to be curved allows curved solutions. These 
arguments are not necessarily valid if you only want to have variance 
partitioning, or if you use polynomial conditions ("partial out" 
polynomial effects in Canoco language). In that case it may make sense 
to use quadratic (or polynomial) constraints or conditions.

cheers, Jari Oksanen



More information about the R-help mailing list