[BioC] Limma vs anova (GeneSpring)
Naomi Altman
naomi at stat.psu.edu
Tue Apr 18 16:43:57 CEST 2006
In general, the big differences should come from the genes with very
small variance or very large variance, and moderate differential
expression. This is because the variance estimate is shrunk towards
the mean in Limma (and SAM), whereas ANOVA uses the sample
variance. Since the t or F test uses the variance estimate in the
denominator, genes with very small sample variance will be less
significant with shrinkage, whereas genes with high variance will be
more significant.
--Naomi
At 08:45 PM 4/17/2006, Wettenhall James wrote:
>Hi,
>
>I am now working in a place where GeneSpring GX is the standard
>microarray analysis tool, whereas my only microarray analysis experience
>is using R/BioC, particularly limma. I'm sure this is not an easy
>question, but is there any recommended reading for trying to answer the
>question "Why do I get really different gene lists from the different
>statistical tests?" (In one case where this has arisen, a thorough
>RT-PCR follow-up is being done, so that will be most interesting.)
>
>Here's a GeneSpring tutorial, and a quick discussion of the statistical
>test used (obviously different from limma):
>http://hsc.unm.edu/som/micro/genomics/tutorial.html#OLE_LINK6
>
>1-Way Anova
>Parameteric test, don't assume variances equal
>Multiple testing correction : Benjamini & Hochberg is the default, but
>if this gives no statistically significant genes, then users seem to
>turn this off. One thing I don't understand is that when multiple
>testing correction is turned on, the p-value cutoff entry box is
>_replaced_ by a false discovery rate entry box - why can't I have both?
>(Probably a question for GeneSpring rather the BioC.)
>
>After the ANOVA test, (particularly if there are more than two
>conditions being compared), a post-hoc test (Tukey or
>Student-Newman-Keuls) can be done to determine which pairs of conditions
>the significant genes differ between.
>
>I have been able to get exactly the same results from normalizing /
>probe-level summary between GeneSpring and BioC. (For Affy data,
>GeneSpring has RMA and GCRMA, but it refers to them as "pre-processing".
>"Normalization" is done later - so I suspect that some users
>over-normalize compared with what is done in BioC, not realizing that
>that RMA "pre-processing" includes a quantile normalization.)
>
>So if anyone can recommend any reading for comparing the "different
>worlds" of microarray analysis, I would be most interested.
>
>Best wishes,
>James
>
>
>
> [[alternative HTML version deleted]]
>
>_______________________________________________
>Bioconductor mailing list
>Bioconductor at stat.math.ethz.ch
>https://stat.ethz.ch/mailman/listinfo/bioconductor
>Search the archives:
>http://news.gmane.org/gmane.science.biology.informatics.conductor
Naomi S. Altman 814-865-3791 (voice)
Associate Professor
Dept. of Statistics 814-863-7114 (fax)
Penn State University 814-865-1348 (Statistics)
University Park, PA 16802-2111
More information about the Bioconductor
mailing list