[BioC] problem with GO annotation data
Nianhua Li
nli at fhcrc.org
Fri Jul 28 03:16:54 CEST 2006
Hi, Shiliang,
You were right that AnnBuilder did have a bug. Thanks for the report!
The bug is related to Weijun's question:
http://thread.gmane.org/gmane.science.biology.informatics.conductor/9227/focus=9232
When baseType is "ll", the baseFile can't be parsed correctly because
some parsers assume Entrez Gene is at the 3rd column of the base file.
Thanks for John's patch, AnnBuilder just quietly converts the base file
into three columns when baseType is "ll". So, weijun's problem got
solved. Unfortunately, the parser used for "go2probe" mapping assumes a
two-column setting for baseType "ll", therefore can't work correctly
with the patch.
Now the problem has been solved. You can either get the up-to-date
AnnBuilder (v 1.11.6) from svn or wait still Saturday noon to download
it using biocLite or from website. You can also modify your local source
AnnBuilder/R/ABPkgBuilder.R if you are in a hurry:
===================================================================
--- ABPkgBuilder.R (v1.11.5 or before)
+++ ABPkgBuilder.R (v1.11.6)
# Map GO ids to probe ids that are directly associated with the GO ids
mapGO2Probe <- function(eg, baseMapType){
- if(baseMapType == "ll"){
- parser(eg) <- file.path(.path.package("AnnBuilder"),
- "scripts", "GO2ProbeParser4LL")
- }else{
- parser(eg) <- file.path(.path.package("AnnBuilder"),
+ parser(eg) <- file.path(.path.package("AnnBuilder"),
"scripts", "GO2ProbeParser")
- }
options(show.error.messages = FALSE)
go2Probe <- try(parseData(eg, eg at go, ncol = 3))
options(show.error.messages = TRUE)
====================================================================
Also the probe number mismatch is because your base file has duplicated
probeset identifiers.
help it works this time :)
nianhua
More information about the Bioconductor
mailing list