[BioC] Weighted counts in countGenomicOverlaps
Valerie Obenchain
vobencha at fhcrc.org
Fri Apr 29 01:56:08 CEST 2011
Mete,
The bug is now fixed in the devel trunk (version 1.5.5) and the release
branch (version 1.4.2). It will be a day before the new package versions
propagate through the build system and are available through biocLite.
If you want to retrieve them directly they are available via svn at
release :
https://hedgehog.fhcrc.org/bioconductor/branches/RELEASE_2_8/madman/Rpacks/GenomicRanges
devel :
https://hedgehog.fhcrc.org/bioconductor/trunk/madman/Rpacks/GenomicRanges
I've included an additional example on the man page for
countGenomicOverlaps illustrating the handling of split reads. Let me
know if you run into other problems.
Take care,
Valerie
On 04/26/2011 03:56 PM, Valerie Obenchain wrote:
> Hi Mete,
>
> Yes, you are using the function correctly and you have found a bug.
> I'll let you know as soon as it's fixed.
>
> Thanks,
> Valerie
>
>
> On 04/25/2011 04:38 PM, Mete Civelek wrote:
>> Hi All,
>>
>> I want countGenomicOverlaps to output a weighted hit count such that
>> when a
>> read maps to, for example four loci, a feature at one of those loci
>> would
>> get 1/4th of a count from that read.
>> At the moment, countGenomicOverlaps doesn't behave the way I expect
>> it to.
>>
>> Consider this example:
>>
>> subj = GRangesList(feature1=GRanges(seq='1', IRanges(10,30),
>> strand='+'))
>> qry = GRangesList(read1=GRanges(seq='1',
>> IRanges(c(10,60,100),c(20,70,110)),
>> strand='+'))
>> countGenomicOverlaps(qry, subj, resolution='divide')
>>
>> I would have expected the hit count to be 1/3 but instead it reports
>> it as
>> 1/2. Am I using this function correctly?
>>
>> My sessioninfo is:
>>
>>
>> R version 2.12.2 (2011-02-25)
>> Platform: x86_64-unknown-linux-gnu (64-bit)
>>
>> locale:
>> [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C
>> [3] LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8
>> [5] LC_MONETARY=C LC_MESSAGES=en_US.UTF-8
>> [7] LC_PAPER=en_US.UTF-8 LC_NAME=C
>> [9] LC_ADDRESS=C LC_TELEPHONE=C
>> [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C
>>
>> attached base packages:
>> [1] stats graphics grDevices utils datasets methods base
>>
>> other attached packages:
>> [1] GenomicRanges_1.4.0 IRanges_1.10.0
>>
>>
>>
>> IMPORTANT WARNING: This email (and any attachments) is ...{{dropped:9}}
>>
>> _______________________________________________
>> Bioconductor mailing list
>> Bioconductor at r-project.org
>> https://stat.ethz.ch/mailman/listinfo/bioconductor
>> Search the archives:
>> http://news.gmane.org/gmane.science.biology.informatics.conductor
>
> _______________________________________________
> Bioconductor mailing list
> Bioconductor at r-project.org
> https://stat.ethz.ch/mailman/listinfo/bioconductor
> Search the archives:
> http://news.gmane.org/gmane.science.biology.informatics.conductor
More information about the Bioconductor
mailing list