[BioC] KEGGgraph bug

Marc Carlson mcarlson at fhcrc.org
Tue Dec 8 00:33:45 CET 2009


Hi Fraser,

I believe that David fixed this bug a couple weeks ago.  Please use
biocLite() to update and let us know if there are any further issues.


  Marc





Fraser Sim wrote:
> Hi David,
>
> Thanks. That doesn’t solve the problem though, try to get the KEGG pathway
> ‘hsa03010’ using 
>
>   
>> getCategoryIndepKGMLurl(‘03010’,organism = ‘hsa’)
>>     
>
> It doesn’t produce a valid html link. It should find
>ftp://ftp.genome.jp/pub/kegg/xml/kgml/non-metabolic/organisms/hsa/hsa03010.
> xml’ which is correct but because 'getKGMLurl' returns the invalid
>ftp://ftp.genome.jp/pub/kegg/xml/kgml/non_metabolic/organisms/hsa/hsa03010.
> xml’, it then returns
>ftp://ftp.genome.jp/pub/kegg/xml/kgml/metabolic/organisms/hsa/hsa03010.xml’
> which does not exist. 
>
> Note the substitution of "_" with "-". I’m pretty sure it’s a bug in
> getKGMLurl.
>
> Thanks,
> Fraser
>
> From: Jitao David Zhang [mailto:davidvonpku at gmail.com] 
> Sent: Monday, November 23, 2009 5:58 AM
> To: Fraser Sim
> Cc: bioc
> Subject: Re: [BioC] KEGGgraph bug
>
> Hi Fraser,
>
>   Thanks for the reporting.  The reason for this is that since April 2009
> the KEGG server saves the non-metabolic and metabolic pathways in two
> different sub-directory separatedly, and so far as I know there is no way to
> tell whether the pathway is metabolic alone from the pathway id. Therefore
> 'getKGMLurl' can NOT garantee to return the correct address. However,
> 'retrieveKGML' tries both metabolic and non-metabolic pathway directories,
> therefore it is able to download the file remotely.
>
>   To retrieve the URL for a pathway, use 'getCategoryIndepKGMLurl' function,
> which tries both non-metabolic and metabolic directories. 'getKGMLurl' will
> be hidden from the next release.
>
>   I hope my answer solves your question. 
>
>   Best wishes,
> David
> 2009/11/20 Fraser Sim <fjsim at buffalo.edu>
> I have found a problem with how 'getKGMLurl' constructs the URL for
> retrieving the KGML files. It looks like the '_' should be '-' in
> non_metabolic portion of the generated url. I am using the current release
> versions, R2.10.0 and KEGGgraph 1.2.0.
>
> Thanks,
> Fraser
>
>   
>> library(KEGGgraph)
>>     
> Loading required package: XML
> Loading required package: graph
>
> Attaching package: 'graph'
>
>
>        The following object(s) are masked from package:XML :
>
>         addNode
>
> Loading required package: Rgraphviz
> Loading required package: grid
>   
>> url <- getKGMLurl('04330', organism = "hsa")
>> url
>>     
> [1]
> "ftp://ftp.genome.jp/pub/kegg/xml/kgml/non_metabolic/organisms/hsa/hsa04330.
> xml"
>   
>> parseKGML(url)
>>     
> failed to load external entity
> "ftp://ftp.genome.jp/pub/kegg/xml/kgml/non_metabolic/organisms/hsa/hsa04330.
> xml"
> Error: 1: failed to load external entity
> "ftp://ftp.genome.jp/pub/kegg/xml/kgml/non_metabolic/organisms/hsa/hsa04330.
> xml"
>   
>> parseKGML(gsub("_","-",url))
>>     
> KEGG Pathway
> [ Title ]: Notch signaling pathway
> [ Name ]: path:hsa04330
> [ Organism ]: hsa
> [ Number ] :04330
> [ Image ] :http://www.genome.jp/kegg/pathway/hsa/hsa04330.png
> [ Link ] :http://www.genome.jp/kegg-bin/show_pathway?hsa04330
> ------------------------------------------------------------
> Statistics:
>        30 node(s)
>        16 edge(s)
>        0 reaction(s)
> ------------------------------------------------------------
>   
>> sessionInfo()
>>     
> R version 2.10.0 (2009-10-26)
> i386-pc-mingw32
>
> locale:
> [1] LC_COLLATE=English_United States.1252  LC_CTYPE=English_United
> States.1252    LC_MONETARY=English_United States.1252
> [4] LC_NUMERIC=C                           LC_TIME=English_United
> States.1252
>
> attached base packages:
> [1] grid      stats     graphics  grDevices datasets  utils     methods
> base
>
> other attached packages:
> [1] KEGGgraph_1.2.0  Rgraphviz_1.24.0 graph_1.24.0     XML_2.6-0
> rcom_2.2-1       rscproxy_1.3-1
>
> loaded via a namespace (and not attached):
> [1] tools_2.10.0
>   
>
> _______________________________________________
> Bioconductor mailing list
> Bioconductor at stat.math.ethz.ch
> https://stat.ethz.ch/mailman/listinfo/bioconductor
> Search the archives:
> http://news.gmane.org/gmane.science.biology.informatics.conductor
>
>
>
>



More information about the Bioconductor mailing list