[R-sig-eco] Error distribution for fractional response
Bob O'Hara
bohara at senckenberg.de
Thu Jan 30 11:08:00 CET 2014
On 30/01/14 10:58, Adhara Pardo wrote:
> Dear R users,
>
> I would like to fit a GLM to some plant regeneration data (see bottom
> of this e-mail). The dependent variable, an index of regeneration, was
> obtained by diviving the number of saplings by the number of adults
> plants present in each plot. The result is a highly skewed variable and
> thus, specifying, for instance, a
> Gaussian distribution does not seem to be appropriate. Data
> transformation does not help either. Do you have any suggestion on the
> best distribution to choose?
Rather than use an index, it might be better to use the number of
saplings directly, and assume they are Poisson distributed (or some form
of over-dispersed Poisson). You can use the log of the number of adults
as an offset:
glm(saplings ~ something + offset(log(adults))
The model is
saplings ~ Poisson(lambda)
log(lambda) = alpha + beta*something + log(adults)
where alpha and beta are the parameters being estimated (lambda is the
expected number of saplings). This model is the same as
lambda = adult*exp(alpha + beta*something)
so it's equivalent to modelling saplings/adults: the adults have just
been moved to the other side of the equation.
Bob
--
Bob O'Hara
Biodiversity and Climate Research Centre
Senckenberganlage 25
D-60325 Frankfurt am Main,
Germany
Tel: +49 69 7542 1863
Mobile: +49 1515 888 5440
WWW: http://www.bik-f.de/root/index.php?page_id=219
Blog: http://blogs.nature.com/boboh
Journal of Negative Results - EEB: www.jnr-eeb.org
More information about the R-sig-ecology
mailing list