[BioC] Many missing GO terms from GOHyperG call using mgu74av2

John Zhang jzhang at jimmy.harvard.edu
Wed Jun 29 18:20:54 CEST 2005


Yes, ?GO or ?hgu95av2GO ...

>X-Original-To: jzhang at jimmy.harvard.edu
>Delivered-To: jzhang at jimmy.harvard.edu
>Date: Wed, 29 Jun 2005 09:14:25 -0700 (PDT)
>From: Dick Beyer <dbeyer at u.washington.edu>
>To: John Zhang <jzhang at jimmy.harvard.edu>
>Cc: Bioconductor <bioconductor at stat.math.ethz.ch>
>Subject: Re: [BioC] Many missing GO terms from GOHyperG call using mgu74av2
>MIME-Version: 1.0
>X-Spam-Checker-Version: SpamAssassin 3.0.1 (2004-10-22) on  
pascal.dfci.harvard.edu
>X-Spam-Level: 
>X-Spam-Status: No, score=-2.6 required=3.0 tests=BAYES_00 autolearn=ham  
version=3.0.1
>
>Hi John,
>
>Thanks for the good suggestion.  Is there a way to tell what the date of the GO 
information is used in a particular build of an annotation package?  I'm asking 
so I can figure out if doing my own build at any particular time is warranted.
>
>Thanks very much,
>Dick
>*******************************************************************************
>Richard P. Beyer, Ph.D.	University of Washington
>Tel.:(206) 616 7378	Env. & Occ. Health Sci. , Box 354695
>Fax: (206) 685 4696	4225 Roosevelt Way NE, # 100
> 			Seattle, WA 98105-6099
>http://depts.washington.edu/ceeh/ServiceCores/FC5/FC5.html
>http://staff.washington.edu/~dbeyer
>*******************************************************************************
>
>On Wed, 29 Jun 2005, John Zhang wrote:
>
>>
>>> In the course of using GOHyperG, I checked my results against results from
>> NETAFFX and found large discrepancies in the number of GO terms returned for
>> some probeids.
>>
>> Unless we build the annotation packages frequently (every week?), there will 
be
>> descripancies. You may try to build your own annotation packages to minimize 
the
>> descripancies.
>>
>>>
>>> Here is an example code snippet that illustrates my point:
>>>
>>> library(affy)
>>> library(mgu74av2)
>>> library(GOstats)
>>> library(annaffy)
>>> smallList <- c("160102_at")
>>> myLL <- unlist(mget(smallList, mgu74av2LOCUSID))
>>> sum(duplicated(myLL))
>>> length(myLLunique<- !duplicated(mget(smallList, mgu74av2LOCUSID)))
>>> bphyper <- GOHyperG(myLL[myLLunique], lib="mgu74av2", what="BP")
>>>
>>> sort(names(bphyper$go2Affy))
>>> [1] "GO:0006457" "GO:0007582" "GO:0008150" "GO:0008152" "GO:0009987"
>> "GO:0019538"
>>> [7] "GO:0043170" "GO:0050875"
>>>
>>> #From NETAFFX for 160102_at
>>> #"GO:0006457" "GO:0007582" "GO:0008150" "GO:0008152" "GO:0009987" 
"GO:0019538"
>>> #"GO:0043170" "GO:0050875"
>>> #"GO:0044238" "GO:0044237" "GO:0044260" "GO:0044260" "GO:0044267
>>>
>>> Notice that NETAFFX returns 5 additional GO terms that are missing from the
>> results from the GOHyperG call.
>>>
>>> Here is my sessionInfo:
>>>
>>>> sessionInfo()
>>> R version 2.1.0, 2005-04-18, i386-pc-mingw32
>>>
>>> attached base packages:
>>> [1] "splines"   "tools"     "methods"   "stats"     "graphics"
>>> [6] "grDevices" "utils"     "datasets"  "base"
>>>
>>> other attached packages:
>>>    annaffy       KEGG   hgu95av2    GOstats   multtest genefilter
>>>   "1.0.18"    "1.6.8"    "1.8.4"    "1.1.2"    "1.6.0"    "1.6.1"
>>>   survival     xtable       RBGL   annotate         GO        XML
>>>     "2.17"    "1.2-5"    "1.3.7"   "1.5.15"    "1.6.8"   "0.95-6"
>>>      graph      Ruuid    cluster   mgu74av2       affy reposTools
>>>    "1.5.0"    "1.5.0"    "1.9.8"    "1.8.4"    "1.6.7"   "1.5.19"
>>>    Biobase
>>>   "1.5.12"
>>>
>>> Is this a problem with the mgu74av2 metadata, or with GOHyperG?
>>>
>>> Thanks much,
>>> Dick
>>> 
*******************************************************************************
>>> Richard P. Beyer, Ph.D.	University of Washington
>>> Tel.:(206) 616 7378	Env. & Occ. Health Sci. , Box 354695
>>> Fax: (206) 685 4696	4225 Roosevelt Way NE, # 100
>>> 			Seattle, WA 98105-6099
>>> http://depts.washington.edu/ceeh/ServiceCores/FC5/FC5.html
>>> http://staff.washington.edu/~dbeyer
>>>
>>> _______________________________________________
>>> Bioconductor mailing list
>>> Bioconductor at stat.math.ethz.ch
>>> https://stat.ethz.ch/mailman/listinfo/bioconductor
>>
>> Jianhua Zhang
>> Department of Medical Oncology
>> Dana-Farber Cancer Institute
>> 44 Binney Street
>> Boston, MA 02115-6084
>>
>>
>

Jianhua Zhang
Department of Medical Oncology
Dana-Farber Cancer Institute
44 Binney Street
Boston, MA 02115-6084



More information about the Bioconductor mailing list