[BioC] Understanding the columns in the limma results output
James W. MacDonald
jmacdon at uw.edu
Tue Aug 28 17:41:00 CEST 2012
Hi Jorge,
On 8/28/2012 11:20 AM, Jorge Miró wrote:
> Hi,
>
> I have run the commands below to get an analysis of differential
> expressions in my Affymetrix arrays
>
> #Prepare the design and contrast matrices for my comparisons of the three
> groups in a loop-manner.
>> design<- model.matrix(~0+factor(c(1,1,1,2,2,2,3,3,3)))
>> colnames(design)<- c('GroupA', 'GroupB', 'GroupC')
>> contrast.matrix<- makeContrasts(GroupB-GroupA, GroupC-GroupA,
> GroupC-GroupB, levels=design)
>
> #Check design and contrast matrices
>> design
> GroupA GroupB GroupC
> 1 1 0 0
> 2 1 0 0
> 3 1 0 0
> 4 0 1 0
> 5 0 1 0
> 6 0 1 0
> 7 0 0 1
> 8 0 0 1
> 9 0 0 1
> attr(,"assign")
> [1] 1 1 1
> attr(,"contrasts")
> attr(,"contrasts")$`factor(c(1, 1, 1, 2, 2, 2, 3, 3, 3))`
> [1] "contr.treatment"
>
>> contrast.matrix
> Contrasts
> Levels GroupB - GroupA GroupC - GroupA GroupC - GroupB
> GroupA -1 -1 0
> GroupB 1 0 -1
> GroupC 0 1 1
>
> #Fitting the eset to to the design and contrast
>> fit<- lmFit(exprs, design)
>> fit2<- contrasts.fit(fit, contrast.matrix)
> #Computing the statistics
>> fit2<- eBayes(fit2)
>
> Then I check the results with topTable and get the following columns in the
> output
>> topTable(fit2)
> GroupB...GroupA GroupC...GroupA GroupC...GroupB AveExpr F
> P.Value adj.P.Val
> 25031 2.3602203 2.4273830 0.06716267 5.021412 29.06509
> 7.844834e-05 0.9587773
> 12902 -0.4572897 -0.5680943 -0.11080467 7.516681 25.41608
> 1.365021e-04 0.9587773
> 7158 -0.4478660 -0.4296077 0.01825833 7.057833 23.48871
> 1.880100e-04 0.9587773
> 18358 -0.1002647 0.3304903 0.43075500 7.352807 22.78417
> 2.125096e-04 0.9587773
> 28768 -0.7695883 -1.3837750 -0.61418667 3.983044 22.47514
> 2.244612e-04 0.9587773
> 28820 -0.1708800 -0.9939680 -0.82308800 5.470826 18.25071
> 5.081473e-04 0.9587773
> 15238 -0.4850297 -0.4658157 0.01921400 7.071662 17.15191
> 6.440979e-04 0.9587773
> 24681 -0.3759717 -0.3486450 0.02732667 9.281578 16.47813
> 7.493077e-04 0.9587773
> 19246 -0.8675393 -0.5082140 0.35932533 8.123538 16.27776
> 7.845150e-04 0.9587773
> 28808 0.2601277 0.6909140 0.43078633 4.814602 16.21283
> 7.963487e-04 0.9587773
>
> I want to export my results and write
>
>> results<- decideTests(fit2)
>> write.fit(fit2, results, "limma_results.txt", adjust="BH")
> Now don't get the same columns as when using topTable which is quite
> confusing. Why don't I get the FC for the comparisons between the different
> groups as if I run topTable with the coef parameter ( "topTable(fit2,
> coef=1)" )? The columns I get are the following
The simple answer is that they are two different functions with
different goals. But note that you do get the same information.
>
> A
>
> Coef.GroupB - GroupA
> Coef.GroupC - GroupA
> Coef.GroupC - GroupB
>
> t.GroupB - GroupA
> t.GroupC - GroupA
> t.GroupC - GroupB
>
> p.value.GroupB - GroupA
> p.value.GroupC - GroupA
> p.value.GroupC - GroupB
>
> p.value.adj.GroupB - GroupA
> p.value.adj.GroupC - GroupA
> p.value.adj.GroupC - GroupB
>
> F
> F.p.value
>
> Res.GroupB - GroupA
> Res.GroupC - GroupA
> Res.GroupC - GroupB
>
>
> Could some body please try to explain what do the columns A, Coef, F,
> F.p.value and Res mean?
A - are your log fold change values
Coef - are your coefficients (you set up a cell means model, so these
are the sample means)
F - is an F-statistic, which tests the null hypothesis that none of the
sample means are different
F.p.value - is the p-value for the F-statistic
Res - is the results matrix you passed into write.fit(), showing which
contrast(s) were significant
Best,
Jim
>
>
>
> #Session info
>> sessionInfo()
> R version 2.15.0 (2012-03-30)
> Platform: i386-pc-mingw32/i386 (32-bit)
>
> locale:
> [1] LC_COLLATE=Swedish_Sweden.1252 LC_CTYPE=Swedish_Sweden.1252
> LC_MONETARY=Swedish_Sweden.1252 LC_NUMERIC=C
> LC_TIME=Swedish_Sweden.1252
>
> attached base packages:
> [1] stats graphics grDevices utils datasets methods base
>
> other attached packages:
> [1] limma_3.12.1 Biobase_2.16.0 BiocGenerics_0.2.0
>
> loaded via a namespace (and not attached):
> [1] affylmGUI_1.30.0 IRanges_1.14.4 oneChannelGUI_1.22.10
> stats4_2.15.0 tcltk_2.15.0
> Best regards
> JMA
>
> [[alternative HTML version deleted]]
>
> _______________________________________________
> Bioconductor mailing list
> Bioconductor at r-project.org
> https://stat.ethz.ch/mailman/listinfo/bioconductor
> Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor
--
James W. MacDonald, M.S.
Biostatistician
University of Washington
Environmental and Occupational Health Sciences
4225 Roosevelt Way NE, # 100
Seattle WA 98105-6099
More information about the Bioconductor
mailing list