[BioC] Limma and fitFDist()
Gordon Smyth
smyth at wehi.EDU.AU
Thu May 4 04:22:17 CEST 2006
Dear Gerhard,
There have been only two changes to the empirical Bayes calculations
in limma since the early days. These have been in limma versions
2.4.0 and 2.4.13.
The change in 2.4.0 was introduced to better handle the possibility
of residual standard deviations being exactly zero. The residual
standard deviations were offset slightly away from zero. This in turn
was prompted by the fact that rma() can produce a subset of
probe-sets with identifical expression values across samples when the
number of samples is small (three or four), as an artifact of the
summarisation method. You can read the discussion from the
Bioconductor mailing list at that time.
Later I became worried that the above change would make limma
slightly more conservative and would change people's historical
results. So in 2.4.13 I wound it back a bit so that limma would give
the same results as it did historically for most data sets, while
still giving some protection against zero residual variances.
The fact that you are seeing different results between 2.4.11 and
2.4.13 suggests that you have a data set with some very small
standard deviations. The more conservative results from 2.4.11
(slightly larger standard deviations but more smoothing) are probably
more reliable for your data. But even better would be to indentify
why you have such extremely small standard deviations and seek to avoid them.
Best wishes
Gordon
At 04:30 AM 3/05/2006, Gerhard Thallinger wrote:
>Dear Gordon,
>
> I am using limma regularly to analyze our microarray experiments
>and would like to thank you for providing such an invaluable tool.
>
>A while ago I analyzed an experiment with limma 2.4.11. It consists of
>20 single channel hybridizations of 10 samples (from 2 groups) before
>and after treatment. Limma identified 45 (out of 15000) DE genes
>(with a p-value < 0.01) between the 2 sample groups.
>
>Recently I reanalyzed the experiment after upgrading to limma 2.4.13.
>Using the same script the number of DE genes dropped to 15 only.
>
>Being puzzeled, I checked the changeLog() which mentions a change in
>fitFDist(). Replacing fitFDist() with the version from 2.4.11 brings
>back the 45 DE genes from the first analysis.
>
>Now I am wondering which of the results is more "reliable" and whether
>there is a way to compensate for the change in fitFDist() (like
>increasing the p-value threshold, ...).
>
>Any hints are highly appreciated
>
>Gerhard
>
>P.S.: I had the same results with R 2.3.0 and limma 2.6.0 (aka 2.4.15 ??).
>
>------------------------------------------------------------------------
>DI Gerhard Thallinger E-mail: Gerhard.Thallinger at tugraz.at
>Institute for Genomics and Bioinformatics Web: http://genome.tugraz.at
>Graz University of Technology Tel: +43 316 873 5343
>Petersgasse 14/V Fax: +43 316 873 5340
>8010 Graz, Austria Map: http://genome.tugraz.at/Loc.html
More information about the Bioconductor
mailing list