[BioC] In the TranscriptDb object, where do the Entrez Gene IDs come from ?

Marc Carlson mcarlson at fhcrc.org
Mon Jan 31 18:05:03 CET 2011


Hi Arnaud,

I am glad that you like this feature.   The ultimate source of this
information is by linking the known gene table to the knownToLocusLink
table at UCSC.

http://genome.ucsc.edu/cgi-bin/hgTables?hgsid=184855093&hgta_doSchemaDb=hg19&hgta_doSchemaTable=knownToLocusLink


  Marc



On 01/26/2011 01:31 PM, Arnaud Amzallag, Ph.D wrote:
> Hello, 
>
> I have been trying on the UCSC table browser to find a table that links kgIDs with gene IDs, but I did not find that. With the package GenomicFeatures, when constructing a TranscriptDB object from UCSC, one can group the transcripts by gene, the gene names being the Entrez gene IDs, which is quite neat.
>
>   
>> txdb <- makeTranscriptDbFromUCSC(genome = "hg18", tablename = "knownGene")
>> txbygene <- transcriptsBy(txdb, by="gene")
>> head(names(txbygene))
>>     
> [1] "1"         "10"        "100"       "1000"      "10000"     "100009676"
>
> I would like to know how the the kgID have been associated with Entrez gene IDs.
>
> Thanks !
>
> Arnaud
>
>
>
> The information in this e-mail is intended only for the ...{{dropped:11}}
>
> _______________________________________________
> Bioconductor mailing list
> Bioconductor at r-project.org
> https://stat.ethz.ch/mailman/listinfo/bioconductor
> Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor
>



More information about the Bioconductor mailing list