[Bioc-devel] systemPipeR error - Error in NSBS(i, x, exact = exact, upperBoundIsStrict = !allow.append) :

Arora, Sonali sarora at fredhutch.org
Fri Oct 23 02:39:04 CEST 2015


Hi Thomas,

I get the following error when I try to obtain the feature types using 
the function genFeatures()


 > library(systemPipeR)
 > library(GenomicFeatures)
Loading required package: AnnotationDbi
 > txdb <-  makeTxDbFromUCSC(genome = "hg19", tablename = "refGene")
Download the refGene table ... OK
Download the refLink table ... OK
Extract the 'transcripts' data frame ... OK
Extract the 'splicings' data frame ... OK
Download and preprocess the 'chrominfo' data frame ... OK
Prepare the 'metadata' data frame ... OK
Make the TxDb object ... OK
Warning message:
In .extractCdsLocsFromUCSCTxTable(ucsc_txtable, exon_locs) :
   UCSC data anomaly in 359 transcript(s): the cds cumulative length is
   not a multiple of 3 for transcripts ‘NM_001037501’ ‘NM_001277444’
   ‘NM_001037675’ ‘NM_001271872’ ‘NM_001170637’ ‘NM_001300952’
   ‘NM_015326’ ‘NM_017940’ ‘NM_001271870’ ‘NM_001143962’ ‘NM_001305275’
   ‘NM_001146344’ ‘NM_001300891’ ‘NM_001010890’ ‘NM_001300891’
   ‘NM_001289974’ ‘NM_001291281’ ‘NM_001301371’ ‘NM_016178’
   ‘NM_001134939’ ‘NM_001080427’ ‘NM_001145710’ ‘NM_001291328’
   ‘NM_001271466’ ‘NM_001017915’ ‘NM_005541’ ‘NM_000348’ ‘NM_001145051’
   ‘NM_001135649’ ‘NM_001128929’ ‘NM_001080423’ ‘NM_001144382’
   ‘NM_001291661’ ‘NM_002958’ ‘NM_001005861’ ‘NM_004636’ ‘NM_001005914’
   ‘NM_001290060’ ‘NM_001290061’ ‘NM_001289930’ ‘NM_003715’
   ‘NM_001290049’ ‘NM_001286054’ ‘NM_001286053’ ‘NM_001286052’
   ‘NM_182524’ ‘NM_001075’ ‘NM_00 [... truncated]
 > feat <- genFeatures(txdb, featuretype="all", reduce_ranges=TRUE, 
upstream=1000,
+                    downstream=0, verbose=TRUE)
Error in NSBS(i, x, exact = exact, upperBoundIsStrict = !allow.append) :
   subscript contains NAs


probably because -

Browse[2]> tx
GRanges object with 54439 ranges and 3 metadata columns:
                 seqnames           ranges strand   |      tx_name
                    <Rle>        <IRanges>  <Rle> |  <character>
       [1]           chr1   [11874, 14409]      +   |    NR_046018
       [2]           chr1   [30366, 30503]      +   |    NR_036051
       [3]           chr1   [30366, 30503]      +   |    NR_036266
       [4]           chr1   [30366, 30503]      +   |    NR_036267
       [5]           chr1   [30366, 30503]      +   |    NR_036268
       ...            ...              ...    ... ...          ...
   [54435] chrUn_gl000228 [112605, 114676]      +   | NM_001306068
   [54436] chrUn_gl000228 [ 29339,  32226]      -   | NM_001005217
   [54437] chrUn_gl000228 [ 29339,  32226]      -   | NM_001286820
   [54438] chrUn_gl000241 [ 14739,  36767]      -   |    NR_132315
   [54439] chrUn_gl000241 [ 16025,  36957]      -   |    NR_132320
                   gene_id     tx_type
           <CharacterList> <character>
       [1]       100287102        <NA>
       [2]       100302278        <NA>
       [3]       100422831        <NA>
       [4]       100422834        <NA>
       [5]       100422919        <NA>
       ...             ...         ...
   [54435]       100288687        <NA>
   [54436]          448831        <NA>
   [54437]          448831        <NA>
   [54438]       100289097        <NA>
   [54439]       102723780        <NA>
   -------
   seqinfo: 93 sequences (1 circular) from hg19 genome
Browse[2]>  unique(mcols(tx)$tx_type)
[1] NA
debug: tmp <- tx[mcols(tx)$tx_type == tx_type[i]]
Browse[2]>
Error in NSBS(i, x, exact = exact, upperBoundIsStrict = !allow.append) :
   subscript contains NAs


Here is my sessionInfo

 > sessionInfo()
R Under development (unstable) (2015-10-15 r69519)
Platform: x86_64-pc-linux-gnu (64-bit)
Running under: Ubuntu 14.04.2 LTS

locale:
  [1] LC_CTYPE=en_US.UTF-8       LC_NUMERIC=C
  [3] LC_TIME=en_US.UTF-8        LC_COLLATE=en_US.UTF-8
  [5] LC_MONETARY=en_US.UTF-8    LC_MESSAGES=en_US.UTF-8
  [7] LC_PAPER=en_US.UTF-8       LC_NAME=C
  [9] LC_ADDRESS=C               LC_TELEPHONE=C
[11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C

attached base packages:
[1] parallel  stats4    stats     graphics  grDevices utils datasets
[8] methods   base

other attached packages:
  [1] GenomicFeatures_1.23.3     AnnotationDbi_1.33.0
  [3] systemPipeR_1.5.1          RSQLite_1.0.0
  [5] DBI_0.3.1                  ShortRead_1.25.10
  [7] GenomicAlignments_1.7.1    SummarizedExperiment_1.1.0
  [9] Biobase_2.31.0             BiocParallel_1.5.0
[11] Rsamtools_1.23.0           Biostrings_2.39.0
[13] XVector_0.11.0             GenomicRanges_1.21.32
[15] GenomeInfoDb_1.7.1         IRanges_2.5.3
[17] S4Vectors_0.9.5            BiocGenerics_0.17.0

loaded via a namespace (and not attached):
  [1] Rcpp_0.12.1            lattice_0.20-33        GO.db_3.2.2
  [4] digest_0.6.8           plyr_1.8.3 futile.options_1.0.0
  [7] BatchJobs_1.6          ggplot2_1.0.1          zlibbioc_1.17.0
[10] annotate_1.49.0        Matrix_1.2-2           checkmate_1.6.2
[13] proto_0.3-10           GOstats_2.37.0         splines_3.3.0
[16] stringr_1.0.0          pheatmap_1.0.7         RCurl_1.95-4.7
[19] biomaRt_2.27.0         munsell_0.4.2          sendmailR_1.2-1
[22] rtracklayer_1.31.1     base64enc_0.1-3        BBmisc_1.9
[25] fail_1.3               edgeR_3.13.0           XML_3.98-1.3
[28] AnnotationForge_1.13.0 MASS_7.3-44            bitops_1.0-6
[31] grid_3.3.0             RBGL_1.47.0            xtable_1.7-4
[34] GSEABase_1.33.0        gtable_0.1.2           magrittr_1.5
[37] scales_0.3.0           graph_1.49.1           stringi_1.0-1
[40] hwriter_1.3.2          reshape2_1.4.1         genefilter_1.53.0
[43] limma_3.27.0           latticeExtra_0.6-26 futile.logger_1.4.1
[46] brew_1.0-6             rjson_0.2.15           lambda.r_1.1.7
[49] RColorBrewer_1.1-2     tools_3.3.0            Category_2.37.0
[52] survival_2.38-3        colorspace_1.2-6




-- 
Thanks and Regards,
Sonali



More information about the Bioc-devel mailing list