[BioC] Help with limma design and contrasts matrices

Gordon K Smyth smyth at wehi.EDU.AU
Sun Oct 24 01:55:08 CEST 2004


This experiment has a deep problem in that there is no replication which replicates the RNA
extraction process.  Your replicates are just aliquots of a single RNA sample or, even worse,
aliquots of a labelled sample.  Hence there is no way to realistically estimate the array to array
variation against which treatment effects can be compared.

By comparison, the batch effect for C is not so important.  The experiment is balanced with
respect to A, B and C and the treatment differences, so any difference between C and the other two
replicates will cancel out.

I think that your only option is to rank genes and forget about format hypothesis testing.  I
guess that if I was analysing this data I would treat the labelled aliquots (A&B) as correlated
blocks and might include a fixed effect for the C vs A,B batch effect.  The block effect would be
included using duplicateCorrelation() with block=c(1,1,2,3,3,4,5,5,6 etc).  However this analysis
will over-state the true level of significance, by an unknown amount, because of the replication
problems already mentioned.  The ranking of the genes will be correct if (i) the measurement error
dominates or (ii) the higher level error components are proportion to the lower.

The section on "Special Designs" in the limma User's Guide at
http://bioinf.wehi.edu.au/limma/usersguide.pdf might be helpful.  You do not make it clear what
documentation you have read or whether you have already tried any possibilities.

Gordon

----------- original message ------------------
Fri Oct 22 06:51:08 CEST 2004
--------------------------------------------------------------------------------

Dear BioC List Members:

I have a data set that I would like to analyze with the limma package.  I am
having trouble figuring out how to make the design and contrasts matrices,
and I was hoping that someone would advise me.  The data set contains the
results from 18 Affymetrix hybridizations.  The table below explains the
experimental design:

  Sample	      RNA aliquot   Labeled sample aliquot	 Dose of Tx1
+/- Tx2
Untreated - A		1	 	   1
0		   -
Untreated - B		1		   2
0		   -
Untreated - C		2		  N/A
0		   -
Tx1 Dose1 - A		1		   1
Dose1	  	   -
Tx1 Dose1 - B		1		   2
Dose1	  	   -
Tx1 Dose1 - C		2		  N/A
Dose1	  	   -
Tx1 Dose2 - A		1		   1
Dose2	  	   -
Tx1 Dose2 - B		1		   2
Dose2	  	   -
Tx1 Dose2 - C		2		  N/A
Dose2	  	   -
Tx2 - A			1		   1
0		   +
Tx2 - B			1		   2
0		   +
Tx2 - C			2		  N/A
0		   +
Tx2 + Tx1 Dose1 - A	1		   1
Dose1	  	   +
Tx2 + Tx1 Dose1 - B	1		   2
Dose1	  	   +
Tx2 + Tx1 Dose1 - C	2		  N/A
Dose1	  	   +
Tx2 + Tx1 Dose2 - A	1		   1
Dose2	  	   +
Tx2 + Tx1 Dose2 - B	1		   2
Dose2	  	   +
Tx2 + Tx1 Dose2 - C	2		  N/A
Dose2	  	   +

Tx = treatment
N/A = not applicable

Note that two types of technical replicates were performed:  samples labeled
with "A" and "B" are replicates at the level of the labeled sample aliquot,
and samples labeled with "C" are replicates that were performed at the level
of the RNA aliquot.  In addition, the "C" replicates were done at a much
later date using a different lot of microarrays, a different lot of
reagents, and a different scanner.  It is not surprising that we are
observing a batch effect in the "C" replicates that is not removed even
after normalization.

Specific questions that I am hoping to have answered:
1. Can I use limma to remove the batch effect that I have observed with the
third replicate?
2. If so, could someone please help me with the creation of the design and
contrasts matrices?  The comparisons that I would like to make are the
following:
     Tx1 Dose1  vs.  Untreated
     Tx1 Dose2  vs.  Untreated
     Tx2  vs.  Untreated
     Tx2 + Tx1 Dose1  vs.  Untreated
     Tx2 + Tx1 Dose2  vs.  Untreated
3. If anyone has any other suggestions for methods other than limma that I
could use to analyze this data set, I would greatly appreciate hearing those
as well.

Thank you,

Jim

_____________________
Jim Breaux, Ph.D.
ViaLogy Corp.
2400 Lincoln Ave.
Altadena, CA 91001
Office: (626) 296-6473
jim.breaux at vialogy.com



More information about the Bioconductor mailing list