[BioC] DESeq Variance Stabilizing Transformation

Hickman, R.J. (Richard) R.J.Hickman at uu.nl
Mon May 13 11:06:26 CEST 2013


Dear All,

I am looking for some feedback regarding the use of the variance-stabilization (VST) methods found in the DESeq2 package. For me, the purpose for applying this transformation is to be able to generate moderated fold changes for clustering of genes (not samples as described in the DESeq vignette).

My data consists of a time series, where for each time point there is a "treated" sample and a "control" sample. Each sample (timepoint) consists of 4 biological replicates.

I performed the VST on the entire set of data and plot the per-gene standard deviation against the rank of the
mean* (see attached figure timeseriesVST.png), for the shifted logarithm log2 (n + 1) (left) and the variance stabilizing transformation (right), it does not appear to have a pronounced effect.

However, if i set up a count dataset that consists of the samples corresponding to one timepoint only (see attached figure singleTimepointVST.png), and perform the VST and plot the standard deviation against rank of the mean, the transformed values have a much better stabilized standard deviation.

So my questions are: Is there anyway to obtain better variance stabilized data when considering the entire timeseries? Should I just perform the VST on a per timepoint basis; after all I will only be computing fold changes between treatment and control samples at the same timepoint.


Best wishes,

Richard

*The procedure was performed as per the DESeq2 manual:

dds <- estimateSizeFactors(dds)
dds <- estimateDispersions(dds)
vsd <- varianceStabilizingTransformation(dds)
par(mfrow=c(1,2))
plot(rank(rowMeans(counts(dds))), genefilter::rowVars(log2(counts(dds)+1)), main="log2(x+1) transform")
plot(rank(rowMeans(assay(vsd))), genefilter::rowVars(assay(vsd)), main="VST")






-------------- next part --------------
A non-text attachment was scrubbed...
Name: singleTimepointVST.png
Type: image/png
Size: 300794 bytes
Desc: singleTimepointVST.png
URL: <https://stat.ethz.ch/pipermail/bioconductor/attachments/20130513/eab08c05/attachment.png>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: timeseriesVST.png
Type: image/png
Size: 367657 bytes
Desc: timeseriesVST.png
URL: <https://stat.ethz.ch/pipermail/bioconductor/attachments/20130513/eab08c05/attachment-0001.png>


More information about the Bioconductor mailing list