[R-sig-ME] lme for data that is not normally distributed

Ben Bolker bbolker at gmail.com
Wed Aug 3 22:14:46 CEST 2016


   For what it's worth, this graph is assessing
linearity/heteroscedasticity rather than Normality (you would want a Q-Q
plot, not a fitted vs residuals plot, for that).  This doesn't look too
terrible, but there does seem to be a bit of 'flare' at the
large-fitted-value end, which supports Paul's suggestion that you try a
log transformation ...

On 16-08-03 03:58 PM, moses selebatso via R-sig-mixed-models wrote:
> Thank you both Paul and Alain for your help. You both point out that
> I shouldn't test for normality before running a model. I appreciate
> that. Paul I have tried you new scripts and, I guess you were right
> about experience in visually assessing for normality. Not straight
> forward. Below is the plot, for your appreciation. library(lme4) 
> install.packages("devtools") library(devtools) 
> devtools::install_github("pcdjohnson/GLMMmisc") library(GLMMmisc) 
> data<-read.csv("clipboard",sep="\t") m <- lmer(Distance ~ Time + (1 |
> ID), data = data) sim.residplot(m) Regards, Moses SELEBATSO Home:
> (+267) 318 5219 (H)  Mobile:  (+267) 716 39370  or  (+267) 738
> 39370"Those who will ALWAYS agree with you may be oppressed by you"
> 
> On Wednesday, 3 August 2016, 15:54, Paul Johnson
> <paul.johnson at glasgow.ac.uk> wrote:
> 
> 
> 
> Hi Moses,
> 
> I wouldn’t test normality of residuals — better to assess them by
> eye. I know this sounds ad hoc but given that almost no real
> distribution in nature is perfectly normal, the question should be
> “how non-normal can the residuals be before seriously harming my
> inferences?”. This is a more difficult question to answer and
> basically requires experience. A test conflates the degree of
> non-normality and sample size  so a significant result can mean
> “quite normal but high n” while a non-significant result can mean
> “very non-normal but low n”:
> 
> set.seed(1) x <- rpois(1000, 50) hist(x)  # looks beautifully normal 
> shapiro.test(x) # significantly non-normal hist(log(x[1:20])) # looks
> pretty bad shapiro.test(log(x[1:20])) # passes the test
> 
> Given that your distance response measure is (probably) constrained
> to be positive, there’s a good change that it’s right-skewed and
> potentially made more normal by log-transformation (if there are no
> zero distances).
> 
> A good way to visually assess residuals is to plot them against the
> fitted values, then compare these to residuals simulated from the
> fitted model — they should look similar, give or take sampling
> variation. You can do this with a function I recently wrote called
> sim.residplot (available here:
> https://github.com/pcdjohnson/GLMMmisc), although you’ll have to
> refit your model using lmer in the lme4 package:
> 
> library(lme4) library(GLMMmisc) m <- lmer(Distance ~ Time + (1 | ID),
> data = data) sim.residplot(m) # repeat a few times to allow for
> sampling variation
> 
> Good luck, Paul
> 
> 
> 
>> On 3 Aug 2016, at 14:25, moses selebatso via R-sig-mixed-models
>> <r-sig-mixed-models at r-project.org> wrote:
>> 
>> Thank very much for your helpful advice. I ran the model and tested
>> the residuals. They are not normally distributed, and I am still
>> stuck with how I proceed. I tried to copy the output on the email,
>> but I get an error message that the message format cannot sent. 
>> Regards, Moses
>> 
>> On Wednesday, 3 August 2016, 12:15, Highland Statistics Ltd
>> <highstat at highstat.com> wrote:
>> 
>> 
>> 
>> 
>> 
>>> Date: Wed, 3 Aug 2016 09:40:20 +0000 (UTC) From: moses selebatso
>>> <selebatsom at yahoo.co.uk> To: R-sig-mixed-models
>>> <r-sig-mixed-models at r-project.org> Subject: [R-sig-ME] lme for
>>> data that is not normally distributed Message-ID: 
>>> <127496753.15122202.1470217220406.JavaMail.yahoo at mail.yahoo.com> 
>>> Content-Type: text/plain; charset="UTF-8"
>>> 
>>> ?Hello I have some data that I would to analyse with mixed models
>>> (lme). As a standard procedure I tested for the normality of the
>>> data and it is not normal. Any ideas of how deals with this kind
>>> of data? I have a sample below and the model that I was hoping to
>>> use (if?the data?was normal) m <-
>>> lme(Distance~Time,random=~1|ID,data=data).
>> 
>> 
>> Checking normality of the response variable before doing the
>> analysis is a misconception. Why should it be normally distributed?
>> Fit your model and check your residuals for normality.
>> 
>> 
>> Alain
>> 
>>> 
>>> 
>>> 
>>> 
>>> |
>>> 
>>> 
>>> | ID |
>>> 
>>> 
>>> | Time |
>>> 
>>> 
>>> | Distance |
>>> 
>>> 
>>> |
>>> 
>>> 
>>> | 10187A |
>>> 
>>> 
>>> | Pre_dry |
>>> 
>>> 
>>> | 4.31287 |
>>> 
>>> 
>>> |
>>> 
>>> 
>>> | 10187A |
>>> 
>>> 
>>> | Pre_dry |
>>> 
>>> 
>>> | 6.867578 |
>>> 
>>> 
>>> |
>>> 
>>> 
>>> | 10187A |
>>> 
>>> 
>>> | Pre_dry |
>>> 
>>> 
>>> | 4.640427 |
>>> 
>>> 
>>> |
>>> 
>>> 
>>> | 10187A |
>>> 
>>> 
>>> | Post_dry |
>>> 
>>> 
>>> | 4.497807 |
>>> 
>>> 
>>> |
>>> 
>>> 
>>> | 10187A |
>>> 
>>> 
>>> | Post_dry |
>>> 
>>> 
>>> | 9.726069 |
>>> 
>>> 
>>> |
>>> 
>>> 
>>> | 10187A |
>>> 
>>> 
>>> | Post_dry |
>>> 
>>> 
>>> | 5.150089 |
>>> 
>>> 
>>> 
>>> 
>>> Regards, Moses SELEBATSO? [[alternative HTML version deleted]]
>>> 
>>> 
>>> 
>>> ------------------------------
>>> 
>>> Subject: Digest Footer
>>> 
>>> _______________________________________________ 
>>> R-sig-mixed-models mailing list R-sig-mixed-models at r-project.org 
>>> https://stat.ethz.ch/mailman/listinfo/r-sig-mixed-models
>>> 
>>> ------------------------------
>>> 
>>> End of R-sig-mixed-models Digest, Vol 116, Issue 4 
>>> **************************************************
>>> 
>> 
>> -- Dr. Alain F. Zuur
>> 
>> First author of: 1. Beginner's Guide to GAMM with R (2014). 2.
>> Beginner's Guide to GLM and GLMM with R (2013). 3. Beginner's Guide
>> to GAM with R (2012). 4. Zero Inflated Models and GLMM with R
>> (2012). 5. A Beginner's Guide to R (2009). 6. Mixed effects models
>> and extensions in ecology with R (2009). 7. Analysing Ecological
>> Data (2007).
>> 
>> Highland Statistics Ltd. 9 St Clair Wynd UK - AB41 6DZ Newburgh 
>> Tel:  0044 1358 788177 Email: highstat at highstat.com URL:
>> www.highstat.com
>> 
>> _______________________________________________ 
>> R-sig-mixed-models at r-project.org mailing list 
>> https://stat.ethz.ch/mailman/listinfo/r-sig-mixed-models
>> 
>> 
>> 
>> 
>> [[alternative HTML version deleted]]
>> 
>> _______________________________________________ 
>> R-sig-mixed-models at r-project.org mailing list 
>> https://stat.ethz.ch/mailman/listinfo/r-sig-mixed-models
> 
> 
> 
> 
> 
> 
> 
> 
> _______________________________________________ 
> R-sig-mixed-models at r-project.org mailing list 
> https://stat.ethz.ch/mailman/listinfo/r-sig-mixed-models
>



More information about the R-sig-mixed-models mailing list