[BioC] Small RNA seq data analysis using DESeq
Simon Anders
anders at embl.de
Thu Jun 20 19:11:06 CEST 2013
Hi Vedran
> I have a question regarding the analysis of small RNAseq data using
DESeq.
> While counting the reads per loci I have weighted the reads by the
> reciprocal of the places to which the read maps.
> I was wondering whether it is still proper to use the negative binomial
> test implemented in DESeq (after rounding the expression estimates) to
> determine which loci are differentially expressed?
No, for two reason:
1. DESeq expects raw counts. Your weighting violates the assumptions
behind the nehative-binomial model.
2. Imagine two of your loci are quite similar, such that most reads that
map to locus A also map to locus B. Further, imagine that you compare
treated samples to control ones, and locus A gets upregulated in
response to the treatment while locus B is unaffected. With your method
of summerizing the data, all the additional reads that the upregulated
locus A produces in the treatment samples will also be counted for locus
B, and hence, you will wrongly conclude that both loci react to the
treatment.
Note that the second issue is a problem not only to NB-based method, but
rather shows that you approach is in general not suitable for
differential expression analyses.
Simon
More information about the Bioconductor
mailing list