[BioC] GOstats code validation
James W. MacDonald
jmacdon at uw.edu
Mon Sep 30 19:43:06 CEST 2013
On Monday, September 30, 2013 1:36:36 PM, kaushal Raj Chaudhary [guest]
wrote:
>
> Dear List,
>
> I am using your package "GOstats" for my gene expression data. I took seleceted gene id (entrez id) and gene universrse which is all gens in the chip we used (entrez id) form excel file provided by the genome studio software. Here is the code. If you have some available time I want to you to validate the code .
>
>
>
> setwd("W:/KAUSHAL_RESEARCH/K_Surendaran/Data")
>
> source("http://www.bioconductor.org/biocLite.R")
>
> biocLite("AnnotatidatonDbi")
>
> library("AnnotationDbi")
>
> biocLite("illuminaMousev2.db")
>
> library("illuminaMousev2.db")
>
> biocLite("GOstats")
>
> library("GOstats")
>
> biocLite("KEGG.db")
>
> library("KEGG.db")
>
> biocLite("reactome.db")
>
> library("reactome.db")
>
>
>
> data1=read.csv("uni_id.csv", header=T) ### gene universe
>
> data2=data.matrix(data1)
>
> colnames(data2)<-NULL
>
> AllID=as.numeric(data2)
>
> head(AllID)
>
>
>
> [1] 212772 212772 100039693 56378 74257 20510
>
>
> data3=read.csv("selec_ID.csv", header=T) ### selected gene
>
> data4=data.matrix(data3)
>
> colnames(data4)<-NULL
>
> SelecID=as.numeric(data4)
>
> head(SelecID)
>
>
>
> [1] 74257 20322 59010 66398 50995 242864
>
>
> getClass("GOHyperGParams")
>
> getClass("KEGGHyperGParams")
>
>
>
> params<-new("GOHyperGParams", geneIds=SelecID, universeGeneIds=AllID, annotation=c("illuminaMousev2"), ontology="BP", pvalueCutoff=0.01, testDirection="over")
>
> resultBP1<-hyperGTest(params)
>
> resultBP1
>
> summary(resultBP1)
>
> htmlReport(resultBP1, file="ALL_hgo.html")
>
>
>
> Partial output:
> GOBPID Pvalue OddsRatio ExpCount Count Size Term
> GO:0001905 0.000 Inf 0 2 2 activation of membrane attack complex
>
>
>
> I also want to get the genes in the count column.
You can get those genes using probeSetSummary()
Best,
Jim
>
> Thank you very much for help.
>
>
>
>
> -- output of sessionInfo():
>
>> sessionInfo()
> R version 3.0.1 (2013-05-16)
> Platform: x86_64-w64-mingw32/x64 (64-bit)
>
> locale:
> [1] LC_COLLATE=English_United States.1252 LC_CTYPE=English_United States.1252
> [3] LC_MONETARY=English_United States.1252 LC_NUMERIC=C
> [5] LC_TIME=English_United States.1252
>
> attached base packages:
> [1] parallel stats graphics grDevices utils datasets methods base
>
> other attached packages:
> [1] xtable_1.7-1 GO.db_2.9.0 reactome.db_1.44.0
> [4] KEGG.db_2.9.1 GOstats_2.26.0 graph_1.38.3
> [7] Category_2.26.0 illuminaMousev2.db_1.18.0 org.Mm.eg.db_2.9.0
> [10] RSQLite_0.11.4 DBI_0.2-7 AnnotationDbi_1.22.6
> [13] Biobase_2.20.1 BiocGenerics_0.6.0 BiocInstaller_1.10.3
>
> loaded via a namespace (and not attached):
> [1] annotate_1.38.0 AnnotationForge_1.2.2 genefilter_1.42.0 GSEABase_1.22.0
> [5] IRanges_1.18.4 RBGL_1.36.2 splines_3.0.1 stats4_3.0.1
> [9] survival_2.37-4 tools_3.0.1 XML_3.98-1.1
>
> --
> Sent via the guest posting facility at bioconductor.org.
>
> _______________________________________________
> Bioconductor mailing list
> Bioconductor at r-project.org
> https://stat.ethz.ch/mailman/listinfo/bioconductor
> Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor
--
James W. MacDonald, M.S.
Biostatistician
University of Washington
Environmental and Occupational Health Sciences
4225 Roosevelt Way NE, # 100
Seattle WA 98105-6099
More information about the Bioconductor
mailing list