[R] is there a Decode HTML function somewhere (similar to URLdecode)?
Tony Breyal
tony.breyal at googlemail.com
Thu Jun 24 16:49:07 CEST 2010
Thank you kindly Henrique, that works perfectly for me.
On 24 June 2010 15:25, Henrique Dallazuanna <wwwhsd at gmail.com> wrote:
> Try this:
>
> library(XML)
> xmlValue(getNodeSet(htmlParse(x, asText = TRUE), "//p")[[1]])
>
> On Thu, Jun 24, 2010 at 6:53 AM, Tony Breyal <tony.breyal at googlemail.com>
> wrote:
>>
>> (I've just realised that the example above might not be clear when
>> viewed in a browser)
>>
>> So just to clarify, I would like to convert HTML entities into the
>> ASCII equivalent using R, saving the results to a plain text file.
>>
>> For example:
>>
>> x <- paste("i", "s", "n", "&", "a", "p", "o", "s", ";", "t", sep = "")
>>
>> would become the following, when output to the plain text file:
>>
>> > "isn't"
>>
>>
>> Thank you again for your time,
>> Tony Breyal
>>
>> On 23 June 2010 12:32, Tony B <tony.breyal at googlemail.com> wrote:
>> > Dear all,
>> >
>> > I am aware of the URLdecode(..) function and was wondering if there
>> > was something similar for HTML?
>> >
>> > For example, I would like to convert strings like this:
>> >
>> >> x <- "isn't"
>> >
>> > into this:
>> >
>> >> "isn't"
>> >
>> > Many thanks for your time,
>> > Tony Breyal
>> >
>> > # O/S: Windows Vista 32 bit
>> > # R version 2.11.0 (2010-04-22)
>>
>>
>>
>> --
>> Tony Breyal
>>
>> ______________________________________________
>> R-help at r-project.org mailing list
>> https://stat.ethz.ch/mailman/listinfo/r-help
>> PLEASE do read the posting guide
>> http://www.R-project.org/posting-guide.html
>> and provide commented, minimal, self-contained, reproducible code.
>
>
>
> --
> Henrique Dallazuanna
> Curitiba-Paraná-Brasil
> 25° 25' 40" S 49° 16' 22" O
>
--
Tony Breyal
More information about the R-help
mailing list