[BioC] biomaRt : multiple entrezgene ids to one affy id?

Sean Davis sdavis2 at mail.nih.gov
Mon Apr 2 17:53:36 CEST 2007


On Monday 02 April 2007 11:19, Weiwei Shi wrote:
> Dear all:
>
> Here is some piece of output from my query:
> > mart=useMart(biomart="ensembl", dataset="hsapiens_gene_ensembl")
>
> Checking attributes and filters ... ok
>
> > getBM(attributes=c("affy_hg_u95a", "entrezgene","ensembl_transcript_id"),
> > filters="affy_hg_u95a", values="31330_at", mart=mart)
>
>   affy_hg_u95a entrezgene ensembl_transcript_id
> 1     31330_at       6223       ENST00000221975
> 2     31330_at     728953       ENST00000221975
> 3     31330_at     731572       ENST00000221975
>
> I do not understand why there are multiple entrezgene ids
> corresponding to one affy id. Did I miss something here?

Unfortunately, this is what happens when you start mapping between identifiers 
defined by different authorities.  If you look at the entry for ensembl 
transcript, you will see that it maps to three different EntrezGene IDs.  
Looking a bit further, you will notice that only one of them (6223) is based 
on a mapping between a non-predicted sequence.  The other two are predicted 
sequences.  

Here is the entry in ensembl for reference:

http://www.ensembl.org/Homo_sapiens/geneview?gene=ENSG00000105372

Sean



More information about the Bioconductor mailing list