[BioC] questions about Overlap encodings
Henry Paik
hp65 at med.harvard.edu
Tue May 8 18:18:19 CEST 2012
Hi List and Hervé,
First of all, thank you very much for all your hard work on Bioconductor.
I have some questions about OverlapEncodings
1. OverlapEncodings object looks like below.
> ovenc1
OverlapEncodings of length 2
Loffset Roffset encoding
[1] 4 0 2:jm:af:
[2] 4 0 2:jm:af:
Under encoding, 2 is the number of exons in this junction. What do jm
and af (or i, am, gm,...) mean?
2. How could I know which exons are involved to a junction? or how could
I know the exon ranks from OverlapEncodings?
extractSpannedExonRanks function is not implemented?
When I did
?extractSpannedExonRanks
I got
No documentation for ‘extractSpannedExonRanks’
3. How could I know the number of unique reads on unique junctions?
I can filter to get the unique match for each read before bioC analysis.
But to quantify the reads on the junctions, I think I need the count of
the unique read_id on unique junction?
I am learning all these from "Overlap encodings" vignette. Are there any
other documentation to learn more about this?
Thanks much!
- Henry
> sessionInfo()
R version 2.15.0 (2012-03-30)
Platform: x86_64-pc-linux-gnu (64-bit)
locale:
[1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C
[3] LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8
[5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8
[7] LC_PAPER=C LC_NAME=C
[9] LC_ADDRESS=C LC_TELEPHONE=C
[11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C
attached base packages:
[1] stats graphics grDevices utils datasets methods base
other attached packages:
[1] Rsamtools_1.8.4 Biostrings_2.24.1 GenomicFeatures_1.8.1
[4] AnnotationDbi_1.18.0 Biobase_2.16.0 GenomicRanges_1.8.5
[7] IRanges_1.14.2 BiocGenerics_0.2.0
loaded via a namespace (and not attached):
[1] biomaRt_2.12.0 bitops_1.0-4.1 BSgenome_1.24.0 compiler_2.15.0
[5] DBI_0.2-5 RCurl_1.91-1 RSQLite_0.11.1 rtracklayer_1.16.1
[9] stats4_2.15.0 tools_2.15.0 XML_3.9-4 zlibbioc_1.2.0
More information about the Bioconductor
mailing list