> I have a script that generates a large number of gene sets (vectors of gene names) and would like to apply a functional analysis (e.g. how many transcription factors occur in each gene set? how many kinases... etc.). I can convert the gene names into different formats using biomaRt, however, the only functional analysis tools I have found in R apply an enrichment analysis on either GO or KEGG gene sets. Is there a package that allows me to answer more simple questions, e.g. just counting the number of transcription factors in a gene set by connecting to a public database?

Hi, Rainer.

It will depend a bit, but for transcription factors, for example, I
think it suffices to get the genes that are annotated with or
descendents of GO:0003700 (transcription factor activity).


> library(org.Hs.eg.db)
> egs = keys(org.Hs.egGO)
> length(egs)
[1] 45469
> randGenes = sample(egs,100)
> tfGenes = get('GO:0003700',org.Hs.egGO2ALLEGS)
> intersect(tfGenes,randGenes)

