[R] unbalanced effects in aov

Brooke LaFlamme bal44 at cornell.edu
Tue Sep 18 23:06:04 CEST 2007


Thank you for the suggestions from Professor Ripley and Steve Elliot. I see now why my data are unbalanced even though I don't have any missing data. 

I think I should use other methods designed for unbalanced data, but does using lme with plate as a random effect also help to fix this problem? I am still very new at this type of analysis. 

Thank you for the help.

Brooke 



-----Original Message-----

> Date: Tue Sep 18 12:38:37 EDT 2007
> From: "Prof Brian Ripley" <ripley at stats.ox.ac.uk>
> Subject: Re: [R] unbalanced effects in aov
> To: 
>
> On Fri, 14 Sep 2007, Brooke LaFlamme wrote:
> 
> > Hi, I have been having some trouble using aov to do an anova, probably 
> > because I'm not understanding how to use this function correctly. For 
> > some reason it always tells me that "Estimated effects may be 
> > unbalanced", though I'm not sure what this means. Is the formula I am 
> > using written incorrectly? Below is the code I am using along with the 
> > data:
> 
> [...]
> 
> > I am treating all the variables as factors (except for response, obviously).
> >
> > formula<-response~species+line%in%species+replicate%in%line+sex%in%species+plate
> > model<-aov(formula, data=my.data)
> >
> > This is the output:
> >
> >> model
> > Call:
> >   aov(formula = formula, data = my.data)
> >
> > Terms:
> >                     species        plate species:line line:replicate
> > Sum of Squares  0.0026469288 0.0000945202 0.0003320255   0.0002008000
> > Deg. of Freedom            2           11           27             10
> >                 species:sex    Residuals
> > Sum of Squares  0.0001383116 0.0006315465
> > Deg. of Freedom            3           66
> >
> > Residual standard error: 0.003093362
> > Estimated effects may be unbalanced
> >
> > Any help would be greatly appreciated as the R help documentation for 
> > aov does not address this issue.
> 
> For the benefit of those who are unable to appreciate 
> fortunes::fortune("WTFM"), the help page actually says
> 
>       'aov' is designed for balanced designs, and the results can be
>       hard to interpret without balance: beware that missing values in
>       the response(s) will likely lose the balance.  If there are two or
>       more error strata, the methods used are statistically inefficient
>       without balance, and it may be better to use 'lme'.
> 
>       Balance can be checked with the 'replications' function.
> 
> So let's do as it suggests:
> 
> > replications(formula, data=my.data)
> $species
> [1] 40
> 
> $plate
> plate
>   1  2  3  4  5  6  7  8  9 10 11 12
> 11 11 11 10  9 11 10 11 10 11  9  6
> 
> $`species:line`
> [1] 4
> 
> $`line:replicate`
> [1] 6
> 
> $`species:sex`
> [1] 20
> 
> and the problem will be clear to those who have read ?replications.
> 
> -- 
> Brian D. Ripley,                  ripley at stats.ox.ac.uk
> Professor of Applied Statistics,  http://www.stats.ox.ac.uk/~ripley/
> University of Oxford,             Tel:  +44 1865 272861 (self)
> 1 South Parks Road,                     +44 1865 272866 (PA)
> Oxford OX1 3TG, UK                Fax:  +44 1865 272595



More information about the R-help mailing list