[BioC] Correcting for batch effects in limma
James W. MacDonald
jmacdon at uw.edu
Tue May 15 15:16:50 CEST 2012
Hi Khadeeja,
On 5/15/2012 6:06 AM, khadeeja ismail wrote:
> Hi,
>
> Is it possible to correct for batch effects in limma when doing a paired analysis? I have pairs from two
> runs Batch 1 and Batch 2. There are no pairs where one is in Batch1 and the
> other in Batch 2. If I enter the Batch no. into the design matrix,
> no coefficients are generated as there is no difference between run no.
> for any pair.
Without seeing your code it is hard to say much. In addition, it isn't
really clear what you mean by 'no coefficients are generated'. However,
you should note that accounting for the pairing structure and the
batches will to a certain extent be doing the same thing. As an example,
let's consider a single pair from one batch.
If we were to consider the conventional approach for paired data, you
would first compute pair1_treated - pair1_control, and then using these
differences to compute statistics. By computing the paired differences,
we have subtracted out any sample-specific variability, which includes a
batch effect (e.g., if batch 1 has higher overall expression due to some
technical reasons, you would expect both of the pairs to reflect this
higher expression, and subtracting the two would thus eliminate the
batch effect, modulo variability).
When you fit a batch effect, you are in essence computing a mean
expression value for all samples in a particular batch and then
subtracting that from each sample. This is very similar to what you have
done by pairing. Doing both is not likely IMO to add benefit, and is
just wasting degrees of freedom.
Best,
Jim
>
> Any advice on this would be most appreciated.
>
> Thanks,
> Khadeeja
>
> [[alternative HTML version deleted]]
>
>
>
> _______________________________________________
> Bioconductor mailing list
> Bioconductor at r-project.org
> https://stat.ethz.ch/mailman/listinfo/bioconductor
> Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor
--
James W. MacDonald, M.S.
Biostatistician
University of Washington
Environmental and Occupational Health Sciences
4225 Roosevelt Way NE, # 100
Seattle WA 98105-6099
More information about the Bioconductor
mailing list