[BioC] Interaction term logFC
James W. MacDonald
jmacdon at uw.edu
Wed Jul 16 19:46:32 CEST 2014
Hi Neetha,
The interaction term is (algebraically) something like
(colony1_treatment_1 - colony_1_treatment_2) - (colony_2_treatment_1 -
colony_2_treatment_2)
In other words, you are trying to see if the different treatments affect
the two colonies differently. And because of the way it is constructed,
the interaction term fold change isn't by itself that easily interpreted.
As an example, if I just substitute fake numbers into the above formula,
all of these result in a logFC of 2:
(3 - 3) - (1 - 3)
(3 - 1) - (3 - 3)
(3 - 2) - (2 - 3)
There are many other ways a logFC of 2 can arise, so you really want to
look at a plot of the logCPM values to see the underlying pattern. This
is where the ReportingTools package comes in. You can easily make an
HTML table that has little plots in each row that show the
directionality of expression for all four groups. See the vignette for
more information:
http://bioconductor.org/packages/release/bioc/vignettes/ReportingTools/inst/doc/rnaseqAnalysis.pdf
Best,
Jim
On 7/16/2014 1:10 PM, Neetha [guest] wrote:
> Hi,
>
>
> I am using DESeq2 for my RNAseq data analysis. My design has two strains and two conditions. I did my analysis with DESeq2 v 1.4.5 with the design
> dds1 <- DESeqDataSetFromMatrix(countData=counts, colData=design, design=~ Colony + Treatment + Colony:Treatment)
> dds1 <- DESeq(dds1, test="LRT", reduced=Colony + Treatment)
>> resultsNames (dds1)
> [1] "Intercept" "Colony_1_vs_2" "Treatment_1_vs_2" "Colony1.Treatment1"
>
> "Colony1.Treatment1" is the interaction term. Am I right? I don't understand what the log2fold change represents. I read the vignette (last updated May 2014) as well several discussions online, but I am not sure I understand the concept. For those genes with FDR corrected p values below 0.05, when go back and check the VST counts/raw reads I see trends. Could you please help me understand what the log2foldchange means?
> Thanks in advance.
> Neetha
>
>
> -- output of sessionInfo():
>
>> sessionInfo()
> R version 3.1.1 (2014-07-10)
> Platform: i386-w64-mingw32/i386 (32-bit)
>
> locale:
> [1] LC_COLLATE=English_United States.1252 LC_CTYPE=English_United States.1252
> [3] LC_MONETARY=English_United States.1252 LC_NUMERIC=C
> [5] LC_TIME=English_United States.1252
>
> attached base packages:
> [1] parallel stats graphics grDevices utils datasets methods base
>
> other attached packages:
> [1] DESeq2_1.4.5 RcppArmadillo_0.4.320.0 Rcpp_0.11.2
> [4] GenomicRanges_1.16.3 GenomeInfoDb_1.0.2 IRanges_1.22.9
> [7] BiocGenerics_0.10.0 BiocInstaller_1.14.2
>
> loaded via a namespace (and not attached):
> [1] annotate_1.42.0 AnnotationDbi_1.26.0 Biobase_2.24.0 DBI_0.2-7
> [5] genefilter_1.46.1 geneplotter_1.42.0 grid_3.1.1 lattice_0.20-29
> [9] locfit_1.5-9.1 RColorBrewer_1.0-5 RSQLite_0.11.4 splines_3.1.1
> [13] stats4_3.1.1 survival_2.37-7 tools_3.1.1 XML_3.98-1.1
> [17] xtable_1.7-3 XVector_0.4.0
>
>
> --
> Sent via the guest posting facility at bioconductor.org.
>
> _______________________________________________
> Bioconductor mailing list
> Bioconductor at r-project.org
> https://stat.ethz.ch/mailman/listinfo/bioconductor
> Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor
>
--
James W. MacDonald, M.S.
Biostatistician
University of Washington
Environmental and Occupational Health Sciences
4225 Roosevelt Way NE, # 100
Seattle WA 98105-6099
More information about the Bioconductor
mailing list