[R] problems reading XML type file from ishares website
Jeff Newmiller
jdnewmil at dcn.davis.ca.us
Thu Jul 28 20:55:17 CEST 2016
Er, I failed to include the step to write the repaired data to a file...
fnamenobom <- "nobom.xml"
cat( paste( txt, collapse="\n" ), file=fnamenobom )
xmlfile <- xmlTreeParse( fnamenobom )
--
Sent from my phone. Please excuse my brevity.
On July 28, 2016 11:20:23 AM PDT, Jeff Newmiller <jdnewmil at dcn.davis.ca.us> wrote:
>Please keep the list included in the thread (e.g. reply-all?).
>
>I looked at the file and agree that it looks like xml with a utf8 byte
>order mark and Unix line endings, which means it is not XLS and it is
>not XLSX (which is a zipped directory of xml files with DOS line
>endings). Excel complains but manages to open the file if it has the
>XLS extension, but I am not aware that any of the usual R Excel
>packages will understand this file.
>
>The byte order mark can be addressed by opening the file with
>encoding="UTF-8-BOM", but as you mentioned originally the XML structure
>is still broken (c.f. the error message about the Style ending tag).
>Line 16 seems to use /Style rather than /ss:Style. Maybe
>
>library(XML)
>txt <- readLines( fname, encoding="UTF-8-BOM" )
>txt <- sub( "</Style>", "</ss:Style>", txt )
>fnamenobom <- "nobom.xml"
>xmlfile <- xmlTreeParse( "nobom.xml" )
More information about the R-help
mailing list