[R] Pattern Matching Replacement

Hans-Jörg Bibiko bibiko at eva.mpg.de
Thu Jun 19 23:40:32 CEST 2008


On 19.06.2008, at 20:17, ppatel3026 wrote:

>
> I would like to replace "\r\n" with "" in a character string, where  
> "\r\n"
> exists only between < and >, how could I do that?
>
> Initial:
> characterString = "<XML><tag1
> id=\"F\r\n2\"></t\r\nag1>\r\n<tag\r\n2></tag2></XML>"
>
> Result:
> characterString = "<XML><tag1 id=\"F2\"></tag1>\r\n<tag2></tag2></ 
> XML>"
>
> Tried with sub(below) but it only replaces the first instance and I  
> am not
> sure how to pattern match so that it only replaces \r\n that exist  
> within
> tags(< and >).
>
> sub("\r\n", "", charStream)

It's only a very first idea:

gsub("(?<=<)([^>]*?)\\r\\n([^>]*?)(?=>)", "\\1\\2", characterString,  
perl=T)


--Hans



More information about the R-help mailing list