[R-sig-eco] data normality
Bob O'Hara
bohara at senckenberg.de
Sun May 6 17:00:17 CEST 2012
On 05/06/2012 10:54 AM, Yong Shen wrote:
> Dear all,
> I have two questions about data normality.
> I used stepwise multiple regression to determine which variables contributed to tree growth, and want to built a model to explain tree growth. Sample size is about 50 tree species, I think it is not a large sample size, and some variables are not normal distribution.
> 1. Do I have to transform them to normal distributions before I perform multiple regression?
No. The only area where a Normal assumption comes in is that the
residuals are normally distributed. So you can happily fit the model
without worrying about normality until after you've got the model.
> 2. Two variables can not transform to normal distributions although I used some methods (e.g log, sqrt, boxcoxfit), what should I do for the two variables?
Leave them as they are.
Advice that makes life simpler - always the best sort.
Bob
--
Bob O'Hara
Biodiversity and Climate Research Centre
Senckenberganlage 25
D-60325 Frankfurt am Main,
Germany
Tel: +49 69 798 40226
Mobile: +49 1515 888 5440
WWW: http://www.bik-f.de/root/index.php?page_id=219
Blog: http://blogs.nature.com/boboh
Journal of Negative Results - EEB: www.jnr-eeb.org
More information about the R-sig-ecology
mailing list