[BioC] Empty Granges
Valerie Obenchain
vobencha at fhcrc.org
Mon Jan 7 22:25:59 CET 2013
Hi Marco,
On 01/07/2013 10:43 AM, Marco Lo Iacono wrote:
> Hello BioC,
> I have a problem using refLocsToLocalLocs, predictCoding in some GRanges...
>
> rm(list=ls()) #cleaning up
> library(VariantAnnotation)
> library(BSgenome.Hsapiens.UCSC.hg19)
> library(TxDb.Hsapiens.UCSC.hg19.knownGene)
> txdb <- TxDb.Hsapiens.UCSC.hg19.knownGene
>
> roi1 <- GRanges("chr11", IRanges(108236190, width=1)) #There is
> roi2 <- GRanges("chr11", IRanges(108093559, width=1)) #There isn't!!!
> refLocsToLocalLocs(roi1, txdb)
> refLocsToLocalLocs(roi2, txdb)
>
> #but the position chr11:108093559 is presents in the ATM gene
> select(txdb, keys = 472, cols = c("TXCHROM", "TXSTRAND", "TXSTART",
> "TXEND",
> "TXID", "TXNAME"), keytype = "GENEID")
>
> #other positions with the same "empty results"...
> chr4:1807922, chr14:105246407, chr11:108123531
predictCoding() returns results for ranges that fall in coding regions.
To see if your range is in a coding region you can use locateVariants().
> loc <- locateVariants(roi2, txdb, region=AllVariants())
>> loc
> GRanges with 8 ranges and 7 metadata columns:
> seqnames ranges strand | LOCATION QUERYID TXID
> <Rle> <IRanges> <Rle> | <factor> <integer> <integer>
> [1] chr11 [108093559, 108093559] * | fiveUTR 1 41703
> [2] chr11 [108093559, 108093559] * | threeUTR 1 41703
> [3] chr11 [108093559, 108093559] * | promoter 1 41703
> [4] chr11 [108093559, 108093559] * | fiveUTR 1 41704
> [5] chr11 [108093559, 108093559] * | threeUTR 1 41704
> [6] chr11 [108093559, 108093559] * | promoter 1 41704
> [7] chr11 [108093559, 108093559] * | promoter 1 41705
> [8] chr11 [108093559, 108093559] * | promoter 1 43960
> CDSID GENEID PRECEDEID FOLLOWID
> <integer> <character> <character> <character>
> [1] <NA> 472 <NA> <NA>
> [2] <NA> 472 <NA> <NA>
> [3] <NA> 472 <NA> <NA>
> [4] <NA> 472 <NA> <NA>
> [5] <NA> 472 <NA> <NA>
> [6] <NA> 472 <NA> <NA>
> [7] <NA> 472 <NA> <NA>
> [8] <NA> 4863 <NA> <NA>
roi2 overlaps 4 different transcripts (TXID) in 2 genes (GENEID). It
does not fall in a coding region but instead in UTR and promoter regions
(LOCATION). So, it makes sense that you would not see predictCoding()
results for this range.
Valerie
>
>
>
> Thank you in advance for help!
>
> Ciao, Marco
>
> P.s. My sessionInfo()
>
> R version 2.15.2 (2012-10-26)
> Platform: i386-w64-mingw32/i386 (32-bit)
>
> locale:
> [1] LC_COLLATE=Italian_Italy.1252 LC_CTYPE=Italian_Italy.1252
> LC_MONETARY=Italian_Italy.1252 LC_NUMERIC=C
> [5] LC_TIME=Italian_Italy.1252
>
> attached base packages:
> [1] stats graphics grDevices utils datasets methods base
>
> other attached packages:
> [1] VariantAnnotation_1.4.6 Rsamtools_1.10.2
> TxDb.Hsapiens.UCSC.hg19.knownGene_2.8.0
> [4] GenomicFeatures_1.10.1 AnnotationDbi_1.20.3
> Biobase_2.18.0
> [7] BSgenome.Hsapiens.UCSC.hg19_1.3.19 BSgenome_1.26.1
> Biostrings_2.26.2
> [10] GenomicRanges_1.10.5 IRanges_1.16.4
> BiocGenerics_0.4.0
>
> loaded via a namespace (and not attached):
> [1] biomaRt_2.14.0 bitops_1.0-5 DBI_0.2-5
> parallel_2.15.2 RCurl_1.95-3 RSQLite_0.11.2 rtracklayer_1.18.2
> stats4_2.15.2
> [9] tools_2.15.2 XML_3.95-0.1 zlibbioc_1.4.0
>
> _______________________________________________
> Bioconductor mailing list
> Bioconductor at r-project.org
> https://stat.ethz.ch/mailman/listinfo/bioconductor
> Search the archives:
> http://news.gmane.org/gmane.science.biology.informatics.conductor
More information about the Bioconductor
mailing list