[BioC] Is there a package that maps short sequence to exon?
Martin Morgan
mtmorgan at fhcrc.org
Fri Jun 15 22:06:46 CEST 2012
On 06/15/2012 11:08 AM, ying chen wrote:
>
>
>
>
> Hi guys, I just wonder if there is any Bioconductor package that can
> take a short nucleotide sequence (25 mer) or its genomic coordinate
> (chr& pos) as input and return exon number it maps to? Thanks a lot
Take a look at GenomicRanges / GenomicFeatures, you might read your
alignments as readGappedAlignments, or just create a GRanges() object,
library(GenomicRanges)
reads = GRanges(c("chr1", "chr7"),
IRanges(start=c(12614, 195554), width=1))
and then use a package like TxDb.Hspaiens
library(TxDb.Hsapiens.UCSC.hg19.knownGene)
ex = exons(TxDb.Hsapiens.UCSC.hg19.knownGene)
and findOverlaps
hits = findOverlaps(reads, ex)
to discover that your 'query' (reads) overlaps the exons
> queryHits(hits)
[1] 1 1 2
> values(ex)$exon_id[subjectHits(hits)]
[1] 5 2 98786
Martin
> for the help! Ying Chen [[alternative HTML version deleted]]
>
> _______________________________________________ Bioconductor mailing
> list Bioconductor at r-project.org
> https://stat.ethz.ch/mailman/listinfo/bioconductor Search the
> archives:
> http://news.gmane.org/gmane.science.biology.informatics.conductor
--
Computational Biology / Fred Hutchinson Cancer Research Center
1100 Fairview Ave. N.
PO Box 19024 Seattle, WA 98109
Location: Arnold Building M1 B861
Phone: (206) 667-2793
More information about the Bioconductor
mailing list