[R] Pattern Matching Replacement
Hans-Jörg Bibiko
bibiko at eva.mpg.de
Thu Jun 19 23:40:32 CEST 2008
On 19.06.2008, at 20:17, ppatel3026 wrote:
>
> I would like to replace "\r\n" with "" in a character string, where
> "\r\n"
> exists only between < and >, how could I do that?
>
> Initial:
> characterString = "<XML><tag1
> id=\"F\r\n2\"></t\r\nag1>\r\n<tag\r\n2></tag2></XML>"
>
> Result:
> characterString = "<XML><tag1 id=\"F2\"></tag1>\r\n<tag2></tag2></
> XML>"
>
> Tried with sub(below) but it only replaces the first instance and I
> am not
> sure how to pattern match so that it only replaces \r\n that exist
> within
> tags(< and >).
>
> sub("\r\n", "", charStream)
It's only a very first idea:
gsub("(?<=<)([^>]*?)\\r\\n([^>]*?)(?=>)", "\\1\\2", characterString,
perl=T)
--Hans
More information about the R-help
mailing list