[BioC] unable to find known entrezgene with biomaRt

Dick Beyer dbeyer at u.washington.edu
Sat Jan 19 17:08:30 CET 2008


Hi Jim,

Thanks for explaining this to me.  I had assumed that if the gene was in ensembl, then I could get other bits of info such as Entrez Gene ID and such.

Is there some bioconductor way, similar to biomaRt, to access this Entrez Gene ID?  What I am really using the getBM call for is just to get a gene symbol and a gene description given the Entrez Gene ID.

Thanks very much,
Dick  
*******************************************************************************
Richard P. Beyer, Ph.D.	University of Washington
Tel.:(206) 616 7378	Env. & Occ. Health Sci. , Box 354695
Fax: (206) 685 4696	4225 Roosevelt Way NE, # 100
 			Seattle, WA 98105-6099
http://depts.washington.edu/ceeh/ServiceCores/FC5/FC5.html
http://staff.washington.edu/~dbeyer
*******************************************************************************

On Sat, 19 Jan 2008, James W. MacDonald wrote:

> Hi Dick,
>
> I'm not sure I understand your question. When I go to the webpage you 
> reference, there is AFAICT no mention of this gene being the same as Entrez 
> Gene 3514 (other than having the same symbol). Nor does Entrez Gene mention 
> that it is the same as Ensembl Gene ENSG00000211592.
>
> A quick look at the location of the gene would imply that it probably is the 
> same, and not two genes that have the same symbol (which is not unique).
>
> Since both the web interface and the programmatic interface agree, this isn't a 
> matter of inconsistencies between the interfaces, so perhaps the question is 
> why do Entrez Gene and Ensembl not reference each other?
>
> If so, this I think is simply due to the fact that you have two different 
> groups that are doing the annotation, and they are not always perfect at 
> referencing each other.
>
> Best,
>
> Jim
>
>
>
> Dick Beyer wrote:
>> Hello,
>> 
>> I am unable to find some Entrez Gene IDs in the ensembl homo sapiens 
>> database via biomaRt, even though I can access them via the ensembl web.
>> 
>> library(biomaRt)
>> mart <- useMart( "ensembl", dataset="hsapiens_gene_ensembl")
>> 
>> getBM(attributes=c("entrezgene","hgnc_symbol","ensembl_gene_id"),filters="entrezgene",values=3845, 
>> mart=mart)
>>    entrezgene hgnc_symbol ensembl_gene_id
>> 1       3845        KRAS ENSG00000133703
>> 
>> getBM(attributes=c("entrezgene","hgnc_symbol","ensembl_gene_id"),filters="entrezgene",values=3514, 
>> mart=mart)
>> NULL
>> 
>> The ensembl web interface:
>> 
>> http://www.ensembl.org/Homo_sapiens/geneview?gene=ENSG00000211592
>> 
>> shows Entrez Gene ID 3514 corresponds to ensembl_gene_id ENSG00000211592, 
>> IGKC.
>> 
>> I'm curious why my biomaRt session will return good results for some valid 
>> Entrez Gene IDs but not for others.  I'm not sure what to try next.  I'd 
>> very much appreciate any help.
>> 
>> sessionInfo()
>> R version 2.6.1 (2007-11-26)
>> x86_64-redhat-linux-gnu
>> 
>> locale:
>> LC_CTYPE=en_US.UTF-8;LC_NUMERIC=C;LC_TIME=en_US.UTF-8;LC_COLLATE=en_US.UTF-8;LC_MONETARY=en_US.UTF-8;LC_MESSAGES=en_US.UTF-8;LC_PAPER=en_US.UTF-8;LC_NAME=C;LC_ADDRESS=C;LC_TELEPHONE=C;LC_MEASUREMENT=en_US.UTF-8;LC_IDENTIFICATION=C
>> 
>> attached base packages:
>> [1] tools     stats     graphics  grDevices utils     datasets  methods
>> [8] base
>> 
>> other attached packages:
>>   [1] topGO_1.4.0         SparseM_0.75        AnnotationDbi_1.0.6
>>   [4] RSQLite_0.6-4       DBI_0.2-4           GO_2.0.1
>>   [7] Biobase_1.16.2      graph_1.16.1        biomaRt_1.12.2
>> [10] RCurl_0.8-3
>> 
>> loaded via a namespace (and not attached):
>> [1] cluster_1.11.9  rcompgen_0.1-17 XML_1.93-2
>> 
>> Thanks much,
>> Dick
>> *******************************************************************************
>> Richard P. Beyer, Ph.D.	University of Washington
>> Tel.:(206) 616 7378	Env. & Occ. Health Sci. , Box 354695
>> Fax: (206) 685 4696	4225 Roosevelt Way NE, # 100
>>  			Seattle, WA 98105-6099
>> http://depts.washington.edu/ceeh/ServiceCores/FC5/FC5.html
>> http://staff.washington.edu/~dbeyer
>> 
>> _______________________________________________
>> Bioconductor mailing list
>> Bioconductor at stat.math.ethz.ch
>> https://stat.ethz.ch/mailman/listinfo/bioconductor
>> Search the archives: 
>> http://news.gmane.org/gmane.science.biology.informatics.conductor
>



More information about the Bioconductor mailing list