[BioC] intersection between genes and pathways
John Stevens
john.r.stevens at usu.edu
Thu Mar 11 01:06:52 CET 2010
Hi Alberto,
There may be a more elegant way to do this, but I think you'll want to make use of the rat2302PATH2PROBE object in the rat2302.db package. The key is that you first need the path ID, and that's available through the KEGG.db package. Below is some starter code; I hope it helps.
Regards,
John R. Stevens
Utah State University
# load KEGG pathway information
library(KEGG.db)
# define a function to search for path IDs
# - shouldn't need to change this
KEGGTerm2Tag <- function(term)
{
KTL <- eapply(KEGGPATHID2NAME,
function(x){agrep(term,x,value=TRUE)})
Kl <- sapply(KTL,length); names(KTL[Kl>0])
}
# call this function -- look for path ID of pathways
# whose name contains 'hyptertroph'
KEGG <- KEGGTerm2Tag("hyptertroph")
KEGG
# "05410"
# see full name of this path ID
KEGGPATHID2NAME$"05410"
# "Hypertrophic cardiomyopathy (HCM)"
# get probeset IDs from rat2302 array
# that are in this pathway
library(rat2302.db)
gn <- rat2302PATH2PROBE$"05410"
# Then look at the intersection of this vector
# and the vector of probeset IDs
# from your ExpressionSet object SHR
intersect(gn,featureNames(SHR))
-----Original Message-----
From: bioconductor-bounces at stat.math.ethz.ch [mailto:bioconductor-bounces at stat.math.ethz.ch] On Behalf Of Alberto Goldoni
Sent: Wednesday, March 10, 2010 2:45 AM
To: BioC
Subject: [BioC] intersection between genes and pathways
Dear all,
i have a question for you.
If i have a list of 66 genes:
> SHR
ExpressionSet (storageMode: lockedEnvironment)
assayData: 66 features, 4 samples
element names: exprs
phenoData
sampleNames: SHR-PUFA5.CEL, SHR-PUFA6.CEL, SHR-st7.CEL, SHR-st8.CEL
varLabels and varMetadata description:
DIETA: Con PUFA=OMEGA3 o ST=ALIMENTAZIONE STANDARD
TYPE: WK=RATTO NORMALE o SHR=RATTO IPERTESO featureData
featureNames: 1367871_at, 1368692_a_at, ..., 1399089_at (66 total)
fvarLabels and fvarMetadata description: none
experimentData: use 'experimentData(object)'
Annotation: rat2302
and i would found which genes are involved in the "hypertrophy" and "cardiac decompensation" pathways.
the only way i have found is to list all the pathways with my probe ids:
paths <- aafPathway(probeids, "rat2302.db")
but now i have to look manually to every pathway.
There is a method in order to do automatically, because now i have only 66 genes, but if i have 1000 genes is too complicated.
Best regards.
--
-----------------------------------------------------
Dr. Alberto Goldoni
Bologna, Italy
_______________________________________________
Bioconductor mailing list
Bioconductor at stat.math.ethz.ch
https://stat.ethz.ch/mailman/listinfo/bioconductor
Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor
More information about the Bioconductor
mailing list