[R-sig-eco] twinspan classification rules as narrative

Jari Oksanen j@r|@ok@@nen @end|ng |rom ou|u@||
Mon Dec 16 16:39:02 CET 2019


Howdy,

TWINSPAN is not in CRAN. It seems that you found it in github. 

TWINSPAN is an old method, and it seems that people are forgetting how it works. Here some narrative:

First, you have defined cut levels to transform your abundance data into binary indicator “pseudospecies”. You give these cut levels in your call. Each species is split by these cut levels into pseudospecies, and that cut level number is added to the name of species. In your example, the indicator pseudospecies at division 8 are actually Cladgray1 and Cladnigr1 where the added ‘1’ just means that the species just occurs, but can have any abundance value: there is no way of knowing its abundance except for the lower limit (>0). In division 1 you have, for instance, Cladmiti4 which means that the species occurs at least at the cutlevel 4: at least at quantity 5, but it can have any value above that limit.

Now to the narrative for the division. The rule for division 8 (that you mention in your post) is actually "+Cladnigr1 +Cladgray1 < 1”. So they both are at the lowest cut level 1 (present with any abundance), the ‘+’ sign means that they are both positive indicator values and you add +1 for every plot where they occur. Would the sign be ‘-‘, you would add -1 for each presence to give negative scores. Doing this for all species gives you the indicator score: if both species are present, your score is 2, if one is present, your scores is 1 and if neither is present your score is 0. The condition is ‘< 1’ meaning that if neither is present (score 0), the condition is true and you go to final group 16, but if one or both are present (scores 1 or 2), the  condition is false and you continue to division 17. However, this is a tree, and this narrative rule only applies to division 8 and those 16 sampling units it contains: these are split by this rule. To get to this division with this rule you must have satisfied the previous rules leading to this branch. You may see the branch structure using plot(twb): the internal divisions are shown in squared on tree, and the final groups and their sizes as terminal leaves.

The classification rules give you only the lower limit of species, and depending on the indicator score threshold, even some of these indicators may be missing in plot. However, you can use function twintable to see the actual cutlevels for each species. These serve as a cover-class values, but do not give any more detail than the cutlevels you defined.

Cheers, Jari

> On 16 Dec 2019, at 16:44, Gonzalez-Mirelis, Genoveva <genoveva.gonzalez-mirelis using hi.no> wrote:
> 
> Dear all,
> 
> I am trying to understand the results from the twinspan function in the R package that has been recently developed (also named twinspan).
> 
> Particularly, I would like to be able to derive the classification rules (indicator species and abundance values, or rather ranges) for each terminal group of the twinspan classification.
> 
> From this:
> 
> library(twinspan)
> data(ahti)
> twb <- twinspan(ahti, cutlevels = c(0, 0.1, 1, 5, 25, 50, 75))
> summary(twb)
> 
> I understand that say, for group number 16 (the first terminal group encountered) the indicator species were Cladnigr and Cladgray.
> 
> I also understand that the indicator score threshold tells me which path to follow down the tree (left or right). But I struggle to understand just what the indicator score means (1)? And whether it can be related to the original abundance value for those two species at the three relevant sites, namely Ster113, Ster097 and Ster098?
> 
> What would be a narrative way to describe this particular branch of the tree?
> 
> Many thanks in advance,
> 
> Genoveva
> 
> Genoveva Gonzalez Mirelis, Scientist
> Institute of Marine Research
> Nordnesgaten 50
> 5005 Bergen, Norway
> Phone number +47 55238510
> 
> 
> 	[[alternative HTML version deleted]]
> 
> _______________________________________________
> R-sig-ecology mailing list
> R-sig-ecology using r-project.org
> https://stat.ethz.ch/mailman/listinfo/r-sig-ecology



More information about the R-sig-ecology mailing list