[BioC] Help interpreting many contrasts in one contrast versus many individual contrast matrices
Gordon K Smyth
smyth at wehi.EDU.AU
Mon Nov 26 03:35:46 CET 2012
On Mon, 26 Nov 2012, Gordon K Smyth wrote:
> Dear Belisa,
>
> Your experiment has 17 different conditions, so you obviously cannot analyse
> it as a 2x2 experiment. (A 2x2 experiment has only 4 conditions in total.)
>
> The simplest way to analyse your experiment is to create a single factor with
> 25 levels, and to analyse your data as in Section 8.3 in the limma User's
> Guide.
That should read "17 levels", one for each condition.
Gordon
> This allows you to test any hypothesis you like, including testing
> for interactions.
>
> If you have lots of contrasts, but you don't tell topTable() which contrast
> you want to test for, then topTable() will test whether *any* of the
> contrasts are different from zero. This is analogous to an F-test where the
> numerator degrees of freedomm are the number of contrasts. The help page for
> topTable() says:
>
> "topTableF ranks genes on the basis of moderated F-statistics for one or more
> coefficients. If topTable is called with coef that has length greater than 1,
> then the specified columns will be extracted from fit and topTableF called on
> the result. topTable with coef=NULL is the same as topTableF, unless the
> fitted model fit has only one column."
>
> You might find it very help to collaborate with a statistical
> bioinformatician at your own institute, if one is available.
>
> Best wishes
> Gordon
>
>
>> Date: Sat, 24 Nov 2012 06:06:22 -0800 (PST)
>> From: "Belisa Santos [guest]" <guest at bioconductor.org>
>> To: bioconductor at r-project.org, belisa.santos.duarte at gmail.com
>> Subject: [BioC] Help interpreting many contrasts in one contrast
>> matrix versus many individual contrast matrices
>>
>>
>> Hello everybody,
>>
>> I am having a hard time interpreting in a meaningful way the output
>> from a contrast matrix with many contrasts versus a smal contrast matrix
>> with few contrasts and how they compare to each other.
>>
>> # Description of my dataset:
>>
>> Control: No treatment and time zero (total 6 replicates)
>> Treatment A: time1, time2, time3 and time4 (3 replicates each, total 12)
>> Treatment AB: time1, time2, time3 and time4 (3 replicates each, total 12)
>> Treatment AC: time1, time2, time3 and time4 (3 replicates each, total 12)
>> Treatment ABC: time1, time2, time3 and time4 (3 replicates each, total 12)
>>
>> Total of 54 microarrays, where A, B and C are different compounds used for
>> the growth media of the cells.
>>
>> - I do not have ONE unique research question. I want to see the effect of
>> time, the effect of treatment and the effect of the interaction
>> time-treatment. Also, I have one very specific question which is: What is
>> the effect of the interaction BC? (Not interested in the effect of time for
>> this one...)
>>
>> # My approach:
>
>> - I made a design matrix using Control as intercept (so first column
>> (control) filled with 1s)
>
>> - Then made 3 BIG contrast matrices: one for the treatment factor (i.e.
>> all combinations of contrasts between same time different treatment ), one
>> for the time factor (i.e. all combinations of same treatment different
>> time) and one for the interaction treatment-time (all combinations
>> treatment-time). (Still have to come up with a clever way to find the
>> effect of the interaction BC...)
>>
>> # My doubts are:
>>
>> 1) Can I describe my experiment as a 2x2 factorial design (2 factors:
>> time and treatment)? (I ask this because I also have that extra control I
>> used as intercept...)
>>
>> 2) Am I correct to interpret that given that I have used the control as
>> intercept in the design matrix, all subsequent contrasts will have the
>> effect of control "subtracted"?
>
>> 2.1) Is this a correct approach for my case? (Is this conceptually
>> correct? Is it done frequently? Is it the most elegant way to do it, or are
>> there "better" alternatives?)
>>
>> 3) Finally I am having problems interpreting the outcome of my contrasts
>> from the matrices with many contrasts. For example for my contrast matrix
>> for the treatment factor (there are 24 individual contrasts), when I ask
>> for a topTable (without specifying any particular coefficient), what is
>> exactly the meaning of that list? Are those the union of all the genes that
>> are differently expressed in all contrasts and then ordered? Or is there
>> any other testing done that makes this DEG list more meaningful than just
>> doing individual contrasts, uniting the sets and ordering them... I feel
>> these cannot be the same... but do not know... and I need help to interpret
>> it correctly.
>>
>> I would really appreciate some help with these doubts. I have read the
>> documentation several times now, but my experimental design is not fully
>> covered by any example... and i would like to be sure that i am analyzing
>> my data correctly.
>>
>> Thank you in advance for your attention and patience. Kind regards,
>>
>> Belisa
>>
>> -- output of sessionInfo():
>>
>>> sessionInfo()
>> R version 2.15.0 (2012-03-30)
>> Platform: x86_64-apple-darwin9.8.0/x86_64 (64-bit)
>>
>> locale:
>> [1] C/en_US.UTF-8/C/C/C/C
>>
>> attached base packages:
>> [1] stats graphics grDevices utils datasets methods base
>>
>> other attached packages:
>> [1] limma_3.14.1 annotate_1.36.0 hgu133plus2cdf_2.11.0
>> hgu133plus2.db_2.8.0
>> [5] org.Hs.eg.db_2.8.0 RSQLite_0.11.2 DBI_0.2-5
>> AnnotationDbi_1.20.2
>> [9] affy_1.36.0 Biobase_2.18.0 BiocGenerics_0.4.0
>>
>> loaded via a namespace (and not attached):
>> [1] BiocInstaller_1.8.3 IRanges_1.16.4 XML_3.95-0.1
>> affyio_1.26.0
>> [5] parallel_2.15.0 preprocessCore_1.20.0 stats4_2.15.0
>> tools_2.15.0
>> [9] xtable_1.7-0 zlibbioc_1.4.0
>>
>> --
>> Sent via the guest posting facility at bioconductor.org.
>
______________________________________________________________________
The information in this email is confidential and intend...{{dropped:4}}
More information about the Bioconductor
mailing list