[BioC] edgeR: Value of variation of biological variation (BCV)

Gordon K Smyth smyth at wehi.EDU.AU
Tue Aug 14 01:50:51 CEST 2012


Dear Lucia,

I don't have yet any experience with working with in-house generated 
RNA-seq data from brain tissue, so I can't say, but I think I would be 
looking for what sources of variation can be controlled in a case such as 
yours.  When we have had larger BCV from mouse experiments, we've been 
able to track the cause down to sex differences or batch effects or RNA 
contamination.  I have analysed mouse brain tissue using microarrays, and 
found that it was important to control for exactly what aspect of the 
brain was being sampled.  See Achtman et al (Science Translational 
Medicine, 23 May 2012) for such an analysis.

I have noticed that many of the data sets published as examples in 
statistical papers on differential expression for RNA-seq show very much 
larger BCV than I would be happy to have from our in-house controlled 
mouse experiments.  Whether that reflects innate variability or something 
else I don't know.

Best wishes
Gordon

---------------------------------------------
Professor Gordon K Smyth,
Bioinformatics Division,
Walter and Eliza Hall Institute of Medical Research,
1G Royal Parade, Parkville, Vic 3052, Australia.
http://www.statsci.org/smyth

On Mon, 13 Aug 2012, Lucia Peixoto wrote:

> Hi,

> I had a related question regarding BCV, it has been my recent experience
> that working in vivo with heterogeneous tissues such as brain yields a much
> higher BCV than 10%, usually close to 30%
> even though I am working with genetically identical organisms (mice, all
> males as well).
> Is that your observation as well, or does this mean that variation is being
> introduced at some of the sample processing steps?
> I do not have technical replicates, although I have n=5 biological
> replicates for each of my conditions
> Interestingly, filtering out low count reads does not seem to lower the
> average BCV much
>
> thanks very much for your answer
>
> Lucia
>
>
> On Fri, Aug 10, 2012 at 9:40 PM, Gordon K Smyth <smyth at wehi.edu.au> wrote:
>
>> See fourth paragraph of the Discussion of the edgeR glm paper:
>>
>> http://nar.oxfordjournals.org/**content/40/10/4288.long<http://nar.oxfordjournals.org/content/40/10/4288.long>
>>
>> Gordon
>>
>>  Date: Fri, 10 Aug 2012 11:45:31 +0300
>>> From: KJ Lim <jinkeanlim at gmail.com>
>>> To: Bioconductor mailing list <bioconductor at r-project.org>
>>> Subject: [BioC] edgeR: Value of variation of biological variation (BCV)
>>>
>>> Dear edgeR users and R community,
>>>
>>> I'm analysing a set of RNA-Seq data which has 2 different genotypes
>>> (treeHS, treeLS) and time of treatment (0H, 3H, 24H, 96H) with edgeR.
>>>
>>> After carried out estimating the common dispersion, the value of BCV
>>> for my RNA-Seq data was 0.428.
>>> > hl <- estimateGLMCommonDisp(hl, hl.design, verbose=TRUE)
>>>  Disp = 0.18319 , BCV = 0.428
>>>
>>> May I ask, it is common to see the value of variation of biological
>>> variation (BCV) as such? What this value can tells about the RNA-Seq
>>> data? I have these thoughts wondering me for a while. Forgive me if I
>>> have asked a stupid question.
>>>
>>> Thank you very much for your time.
>>>
>>> Have a nice weekend.
>>>
>>> Best regards,
>>> KJ Lim
>>>
>>>
>> ______________________________**______________________________**__________
>> The information in this email is confidential and intend...{{dropped:4}}
>>
>> ______________________________**_________________
>> Bioconductor mailing list
>> Bioconductor at r-project.org
>> https://stat.ethz.ch/mailman/**listinfo/bioconductor<https://stat.ethz.ch/mailman/listinfo/bioconductor>
>> Search the archives: http://news.gmane.org/gmane.**
>> science.biology.informatics.**conductor<http://news.gmane.org/gmane.science.biology.informatics.conductor>
>>
>

______________________________________________________________________
The information in this email is confidential and intend...{{dropped:4}}



More information about the Bioconductor mailing list