[BioC] pathway ID in KEGG.db
Marc Carlson
mcarlson at fhcrc.org
Thu Jul 5 19:07:16 CEST 2012
Hi Ed,
The confusion is because you have an "apples to oranges" comparison
here. There are 390 unique KEGG IDs in the DB. That means that there
are only 390 KEGG names for those pathways. However, those pathway IDs
are matched up to over 3000 different external gene IDs which is why the
second mapping is so much larger. You can learn more about the
different mappings by reading their manual pages like this:
help("KEGGPATHID2NAME")
help("KEGGPATHID2EXTID")
Marc
>
On 07/05/2012 12:48 AM, Ed wrote:
> Hi there,
>
> I found the number of pathways in KEGGPATHID2NAME is 390 while that
> in KEGGPATHID2EXTID is 3152. Am I missing something? BTW, the pathway id's
> used in this package seems inconsistent too.
>
> Thanks.
>
> Ed
>
>> ?KEGGPATHID2NAME
>> xx<- as.list(KEGGPATHID2NAME)
>> if(length(xx)> 0){
> + # get the value for the first key
> + xx[[1]]
> + # Get the values for a few keys
> + if(length(xx)>= 3){
> + xx[1:3]
> + }
> + }
> $`00010`
> [1] "Glycolysis / Gluconeogenesis"
>
> $`00020`
> [1] "Citrate cycle (TCA cycle)"
>
> $`00030`
> [1] "Pentose phosphate pathway"
>
>> length(xx)
> [1] 390
>
>
> ?KEGGPATHID2EXTID
> xx<- as.list(KEGGPATHID2EXTID)
> if(length(xx)> 0){
> # Get the value of the first key
> xx[[1]]
> # Get the values for multiget for a few keys
> if(length(xx)>= 3){
> xx[1:3]
> }
> }
>> length(xx)
> [1] 3152
>
>> sessionInfo()
> R version 2.14.1 (2011-12-22)
> Platform: x86_64-pc-mingw32/x64 (64-bit)
>
> locale:
> [1] LC_COLLATE=Chinese (Simplified)_People's Republic of China.936
> [2] LC_CTYPE=Chinese (Simplified)_People's Republic of China.936
> [3] LC_MONETARY=Chinese (Simplified)_People's Republic of China.936
> [4] LC_NUMERIC=C
> [5] LC_TIME=Chinese (Simplified)_People's Republic of China.936
>
> attached base packages:
> [1] stats graphics grDevices utils datasets methods base
>
> other attached packages:
> [1] KEGG.db_2.6.1 RSQLite_0.11.1 DBI_0.2-5
> [4] AnnotationDbi_1.16.18 Biobase_2.14.0
>
> loaded via a namespace (and not attached):
> [1] IRanges_1.12.5 tools_2.14.1
>
> [[alternative HTML version deleted]]
>
> _______________________________________________
> Bioconductor mailing list
> Bioconductor at r-project.org
> https://stat.ethz.ch/mailman/listinfo/bioconductor
> Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor
More information about the Bioconductor
mailing list