[BioC] GO annotation inconsistency
Daniel Gatti
dmgatti at email.unc.edu
Fri Aug 4 23:23:22 CEST 2006
O/S: Windows XP
R: 2.3.1
Bioconductor: 1.8
I'm trying to get a list of all probes in a given GO category. In the
Bioconductor annotation libraries there are mapping from GO category to
probe ID and from probe ID to GO category. I'm finding that they do not
match in terms of annotation. Here's a sample script:
library(hgu95av2)
library(GO)
# Get list of probe -> GO mappings.
hgu95av2GO.list = as.list(hgu95av2GO)
hgu95av2GO.list = lapply(hgu95av2GO.list, names)
# Work with GO category 7031.
GO.7031.probes = unique(get("GO:0007031", hgu95av2GO2ALLPROBES))
length(GO.7031.probes)
[1] 16
probe2GO.7031 = hgu95av2GO.list[match(GO.7031.probes,
names(hgu95av2GO.list))]
length(grep("GO:0007031", probe2GO.7031))
[1] 11
Note that the GO -> probe list gives me 16 probes in category 7031 while
the probe -> GO list gives me 11 probes. This happens for a lot of
categories. Am I missing some key concept or is there something else
going on?
Thanks,
Dan Gatti
UNC-CH
More information about the Bioconductor
mailing list