[BioC] building a refseq-based transcriptDb: warnings of interest?
    Vincent Carey 
    stvjc at channing.harvard.edu
       
    Fri Jul 23 11:50:49 CEST 2010
    
    
  
> hg18r.txdb = makeTranscriptDbFromUCSC(tablename="refGene")
Download the refGene table ... OK
Download the refLink table ... OK
Extract the 'transcripts' data frame ... OK
Extract the 'splicings' data frame ... OK
Download and preprocess the 'chrominfo' data frame ... OK
Prepare the 'metadata' data frame ... OK
Make the TranscriptDb object ... OK
There were 50 or more warnings (use warnings() to see the first 50)
> warnings()
Warning messages:
1: In .extractUCSCCdsStartEnd(cdsStart[i], cdsEnd[i],
exon_locs$start[[i]],  ... :
  UCSC data anomaly in transcript NM_017940: the cds cumulative length
is not a multiple of 3
2: In .extractUCSCCdsStartEnd(cdsStart[i], cdsEnd[i],
exon_locs$start[[i]],  ... :
  UCSC data anomaly in transcript NM_001037675: the cds cumulative
length is not a multiple of 3
3: In .extractUCSCCdsStartEnd(cdsStart[i], cdsEnd[i],
exon_locs$start[[i]],  ... :
  UCSC data anomaly in transcript NM_001039703: the cds cumulative
length is not a multiple of 3
4: In .extractUCSCCdsStartEnd(cdsStart[i], cdsEnd[i],
exon_locs$start[[i]],  ... :
and so on.  Does this need to be reported to UCSC?
>  sessionInfo()
R version 2.12.0 Under development (unstable) (2010-06-30 r52417)
Platform: x86_64-apple-darwin10.3.0/x86_64 (64-bit)
locale:
[1] C
attached base packages:
[1] stats     graphics  grDevices datasets  tools     utils     methods
[8] base
other attached packages:
[1] GenomicFeatures_1.1.6 GenomicRanges_1.1.15  IRanges_1.7.13
[4] weaver_1.15.0         codetools_0.2-2       digest_0.4.2
loaded via a namespace (and not attached):
[1] BSgenome_1.17.5    Biobase_2.9.0      Biostrings_2.17.26 DBI_0.2-5
[5] RCurl_1.4-2        RSQLite_0.9-1      XML_3.1-0          biomaRt_2.5.1
[9] rtracklayer_1.9.3
    
    
More information about the Bioconductor
mailing list