[BioC] grouping samples from distinct runs in limma

James W. MacDonald jmacdon at med.umich.edu
Thu Mar 2 20:45:01 CET 2006


Hi Edmund,

Edmund Chang wrote:
> Dear list,
> I am working with Affymetrix microarrays and looking for differential 
> gene expression using limma.  We repeated the same microarray experiment 
> two separate times (about a year apart), because *some* of the 
> microarrays came out bad.  So instead of running the same samples again 
> (technical replicates), we generated an entirely new set of RNA.
> (i.e. for treatment A:  1st run- A1, A2, A3;  2nd run- a1, a2, a3)
> (and for treatment B: 1st run- B1,B2,B3; 2nd run- b1,b2,b3)
> 
> I was wondering how to handle this sort of data in limma.  Because they 
> are biologically distinct, I am tempted to group them together (e.g. 
> combining samples A1,A2,A3 and a1, a2, a3 into A1-A6).  But I have 
> reservations about grouping arrays which were run one year apart from 
> each other.

You almost certainly will need to add an effect to your model to account 
for the two batches of chips. A very good example can be found here:

http://bioinf.wehi.edu.au/marray/jsm2005/lab5/lab5.html

Note that they are fitting an effect to account for the day that the 
chips were processed which would be analogous to what you need to do.

An alternative would be to fit a mixed model by using the block argument 
to lmFit().


> 
> How can I prove to myself that samples which were identically treated 
> but came from separate experiments can be grouped together in limma? Do 
> I calculate the correlation coefficient between groups 
> (corfit$consensus) and use that to determine if the groups are closely 
> correlated?

A relatively easy way to check this would be to do a principal 
components analysis and plot the first two PCs (you can use plotPCA() in 
affycoretools to do so). If the separate batches are grouping together, 
this gives you some assurance that there isn't a batch effect. 
Personally, I would err on the side of caution and assume that you need 
to do something to account for differences between the batches.

HTH,

Jim


> 
> or is there another function in limma that can handle this scenario?
> 
> Any input will be highly appreciated!
> Edmund Chang
> 
> _______________________________________________
> Bioconductor mailing list
> Bioconductor at stat.math.ethz.ch
> https://stat.ethz.ch/mailman/listinfo/bioconductor


-- 
James W. MacDonald, M.S.
Biostatistician
Affymetrix and cDNA Microarray Core
University of Michigan Cancer Center
1500 E. Medical Center Drive
7410 CCGC
Ann Arbor MI 48109
734-647-5623



More information about the Bioconductor mailing list