[BioC] GeneSetCollection with GOCollection and ontology argument
Martin Morgan
mtmorgan at fhcrc.org
Sat Nov 24 23:40:18 CET 2012
On 11/16/2012 01:45 AM, Michael Love wrote:
> hi,
>
> I am trying to use the GeneSetCollection function to retrieve the BP GO
> gene sets over some Entrez genes. But it seems like the ontology
> argument of GOCollection is not making a difference to which terms are
> in the resulting GeneSetCollection. In the help for GOCollection it
> says " Unlike evidence code, ontology membership is enforced when
> GOCollection gene sets are constructed." so I expected that the
> resulting object would only have BP GO terms.
Hi Michael -- sorry for the delayed response. This has been addressed in release
v. 1.20.1 and devel version v. 1.21.1; devel requires graph >= 1.37.2) and
R-devel. These are available from svn now, via biocLite all being well Sunday
after 10am Seattle time.
I'm interested in updating GSEABase so bug reports and feedback (especially on
the devel version) welcome; there are some serious performance issues that
unfortunately require a substantial revision.
Martin
>
> thanks for any guidance,
>
> a small example for the Entrez genes 1-20:
>
> library(org.Hs.eg.db)
> library(GSEABase)
> gsc <- GeneSetCollection(as.character(1:20),
> idType=AnnotationIdentifier("org.Hs.eg.db"),
> setType=GOCollection(ontology="BP"))
> table(unlist(eapply(GOTERM[names(gsc)],slot,"Ontology")))
>
> BP CC MF
> 101 26 44
>
> > sessionInfo()
> R version 2.15.0 (2012-03-30)
> Platform: x86_64-unknown-linux-gnu (64-bit)
>
> locale:
> [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C
> [3] LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8
> [5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8
> [7] LC_PAPER=C LC_NAME=C
> [9] LC_ADDRESS=C LC_TELEPHONE=C
> [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C
>
> attached base packages:
> [1] stats graphics grDevices utils datasets methods base
>
> other attached packages:
> [1] GO.db_2.7.1 GSEABase_1.20.0 graph_1.36.1
> [4] annotate_1.34.0 org.Hs.eg.db_2.8.0 RSQLite_0.11.1
> [7] DBI_0.2-5 AnnotationDbi_1.20.2 Biobase_2.18.0
> [10] BiocGenerics_0.4.0 BiocInstaller_1.8.3
>
> loaded via a namespace (and not attached):
> [1] IRanges_1.16.4 parallel_2.15.0 stats4_2.15.0 tools_2.15.0
> [5] XML_3.9-4 xtable_1.7-0
>
> ...also I tried for:
>
> > sessionInfo()
> R Under development (unstable) (2012-11-01 r61067)
> Platform: x86_64-unknown-linux-gnu (64-bit)
>
> locale:
> [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C
> [3] LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8
> [5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8
> [7] LC_PAPER=C LC_NAME=C
> [9] LC_ADDRESS=C LC_TELEPHONE=C
> [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C
>
> attached base packages:
> [1] parallel stats graphics grDevices utils datasets methods
> [8] base
>
> other attached packages:
> [1] GO.db_2.8.0 org.Hs.eg.db_2.8.0 RSQLite_0.11.2
> [4] DBI_0.2-5 GSEABase_1.21.0 graph_1.37.1
> [7] annotate_1.37.1 AnnotationDbi_1.21.5 Biobase_2.19.0
> [10] BiocGenerics_0.5.1 Defaults_1.1-1 BiocInstaller_1.9.4
>
> loaded via a namespace (and not attached):
> [1] IRanges_1.17.13 stats4_2.16.0 tools_2.16.0 XML_3.95-0.1
> [5] xtable_1.7-0
>
> [[alternative HTML version deleted]]
>
> _______________________________________________
> Bioconductor mailing list
> Bioconductor at r-project.org
> https://stat.ethz.ch/mailman/listinfo/bioconductor
> Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor
>
--
Computational Biology / Fred Hutchinson Cancer Research Center
1100 Fairview Ave. N.
PO Box 19024 Seattle, WA 98109
Location: Arnold Building M1 B861
Phone: (206) 667-2793
More information about the Bioconductor
mailing list