[BioC] normalization and analysis of connected designs
Gordon Smyth
smyth at wehi.edu.au
Thu Jul 3 14:27:55 MEST 2003
Dear Ramon,
I am with you. The direct comparison design you describe is a very sensible
type of design which is intended to compare RNA samples using within-spot
comparisons, i.e., log-ratios or M-values. The limma package in
Bioconductor is specifically designed to analyse experiments of this type.
You're quite correct that you do need a connect design in order to compare
all the RNA types in this way.
Wolfgang is arguing for what in my lab we call a 'single-channel analysis'.
The main proponents of single-channel analysis in the literature are Rus
Wolfinger at SAS and Gary Churchhill at the Jax lab. As far as I am aware
there is no software in Bioconductor designed to do single-channel analysis
of cDNA arrays. We (I mean here Jean, Sandrine who wrote the marray
packages and I) don't yet provide single-channel software because we
consider it to be an experimental methodology whose validity is still be
established. Normalization of single-channel data in particular is
something that we are still trying to do a satisfactory job of. The only
discussion of single-channel normalization for cDNA data that I am aware of
in the literature is Yang and Thorne, see below.
Wolfinger and Churchhill fit mixed linear models in which a spot is a
random effect. One then has multiple error strata corresponding to spots,
to individual channel intensities within spots and perhaps to arrays as
well. There are certainly cases where one can get more information out of
this approach than analysing arrays entirely using log-ratios.
Statistically, the method consists of using random effects to recover
information from the between-spot error strata. The real problem is to know
when it is valid to take this approach and when it is not.
I may have misinterpreted Wolfgang, but he does seem to be proposing a even
more radical approach in which the spot error strata is ignored entirely.
(I think that is only way one could get the calculation that the D-A
variance is reduced by a third.) This is more radical than anything I've
seen in the literature, and I don't personally think it would be a good
approach for cDNA microarray data.
Regards
Gordon
Yang, Y. H., and Thorne, N. P. (2003). Normalization for two-color cDNA
microarray data. In: D. R. Goldstein (ed.), Science and Statistics: A
Festschrift for Terry Speed, IMS Lecture Notes - Monograph Series, Volume
40, pp. 403-418.
At 03:27 AM 3/07/2003, w.huber at dkfz-heidelberg.de wrote:
>Hi Ramon,
>
>What makes the difference between D and A hybridized on the same array,
>and on different arrays? It is (a) the between-array variation (e.g.
>because each time the spotter puts down a drop of DNA it is a a little bit
>different, or because the arrays had different surface treatments, etc.),
>and (b) the between-hybridization variation (e.g. different temperatures,
>different volumes of the reaction chamber). These two sources of variation
>need to be compared to others sources, e.g. (c) between-RNA-extraction,
>(d) between-reverse-transcription, (e) between-labeling, (f) between-dyes.
>(c)-(f) are present no matter whether you D and A are on one array or on
>different ones.
>
>That it is possible to make (a) and (b) small is shown by the fact that
>useful results have been obtained through single-color arrays such as Affy
>or Nylon membranes. Whether in your experiment (a) and (b) are small
>compared to (c)-(f) depends on your particular experiment. If they are,
>you are better of with h_3G - h_1R than with the full chain of summands. I
>have seen examples where this seemed to be the case.
>
>Anyone else?
>
>Best regards
> Wolfgang
>
>
>On Wed, 2 Jul 2003, Ramon Diaz-Uriarte wrote:
>... [SNIP]
> > I am not sure I follow this. I understand that, __if__ D and A had been
> > hybridized in the same array, then the variance of their comparison
> would be
> > a third of the variance of the comparison having to use the (two-step)
> > connectiion between A and D. But I am not sure I see how we can directly do
> > h_3G - h_1R
> > (if this were possible, then, there would be no need to use connected
> > designs.)
> >
> > ... [SNIP] ...
> >
> > So either way, I don't get to see how we can directly do
> > h_3G - h_1R
> >
> > But then, maybe I am missing something obvious again...
>
>_______________________________________________
>Bioconductor mailing list
>Bioconductor at stat.math.ethz.ch
>https://www.stat.math.ethz.ch/mailman/listinfo/bioconductor
More information about the Bioconductor
mailing list