[BioC] using biomaRt to find all genes with kinase activity (and perhaps a Gene Ontology question too)

James W. MacDonald jmacdon at med.umich.edu
Sat Dec 13 22:35:28 CET 2008


Hi Andrew,

I don't know if you can get all the children from biomaRt, but you can 
get them from the org.Hs.eg.db package:

 > library(org.Hs.eg.db)
Loading required package: AnnotationDbi
Loading required package: Biobase
Loading required package: tools

Welcome to Bioconductor

   Vignettes contain introductory material. To view, type
   'openVignette()'. To cite Bioconductor, see
   'citation("Biobase")' and for packages 'citation(pkgname)'.

Loading required package: DBI
 > all.egs <- mget("GO:0016301", org.Hs.egGO2ALLEGS)
 > all.egs <- unique(all.egs[[1]])
 > all.dat <-  data.frame("Entrez Gene" = all.egs, "Symbol" = 
unlist(mget(all.egs, org.Hs.egSYMBOL, ifnotfound=NA)), "UniGene" = 
sapply(mget(all.egs, org.Hs.egUNIGENE, ifnotfound=NA), paste, collapse=","))
 > head(all.dat)
    Entrez.Gene Symbol   UniGene
25          25   ABL1 Hs.431048
27          27   ABL2 Hs.159472
90          90  ACVR1 Hs.470316
91          91 ACVR1B Hs.438918
92          92 ACVR2A Hs.470174
93          93 ACVR2B Hs.174273
 > dim(all.dat)
[1] 796   3


Best,

Jim




Andrew Yee wrote:
> Apologies, if this is a naive question, but I've been trying to use biomaRt
> to retrieve all genes with kinase activity.
> 
> I use the GO term GO:0016301 as follows:
> 
>  ensembl = useMart("ensembl", dataset = "hsapiens_gene_ensembl")
> results <- getBM(c("entrezgene", "hgnc_symbol", "unigene"), filters = "go",
> values = "GO:0016301"  , mart=ensembl)
> 
> This pulls the usual results, e.g. like EGFR.
> 
> 
> However, it doesn't retrieve results for genes like ATM, which in Gene
> Ontology, is listed as a child of the GO term
> 
> http://amigo.geneontology.org/cgi-bin/amigo/term-assoc.cgi?gptype=all&speciesdb=all&taxid=9606&evcode=all&term_assocs=all&term=GO%3A0016301&session_id=7978amigo1229194926&action=filter
> 
> I would have thought that this query would retrieve genes listed with that
> GO term and the associated children of the GO term.
> 
> On a follow up note, is there a filter by which I can just search "kinase"
> with biomaRt?
> 
> Many thanks,
> Andrew
> 
> 	[[alternative HTML version deleted]]
> 
> _______________________________________________
> Bioconductor mailing list
> Bioconductor at stat.math.ethz.ch
> https://stat.ethz.ch/mailman/listinfo/bioconductor
> Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor

-- 
James W. MacDonald, M.S.
Biostatistician
Hildebrandt Lab
8220D MSRB III
1150 W. Medical Center Drive
Ann Arbor MI 48109-5646
734-936-8662



More information about the Bioconductor mailing list