[R] replacing unicode characters

Iris Simmons |kw@|mmo @end|ng |rom gm@||@com
Thu Jun 29 00:58:57 CEST 2023


Hiya!


You can do this by specifying sub="c99" instead of "Unicode":

```R
x <- "fa\xE7ile"
xx <- iconv(x, "latin1", "UTF-8")
iconv(xx, "UTF-8", "ASCII", "c99")
```

produces:

```
> x <- "fa\xE7ile"
> xx <- iconv(x, "latin1", "UTF-8")
> iconv(xx, "UTF-8", "ASCII", "c99")
[1] "fa\\u00e7ile"
>
```

For future reference, you can find this in section Examples of the
help page ?iconv
I hope this helps!

On Wed, Jun 28, 2023 at 3:09 PM Adrian Dușa <dusa.adrian using gmail.com> wrote:
>
> Dear list,
>
> Building on the example from ?iconv:
> x <- "fa\xE7ile"
> xx <- iconv(x, "latin1", "UTF-8") # "façile"
>
> and:
>
> iconv(xx, "UTF-8", "ASCII", "Unicode")
> # "fa<U+00E7>ile"
>
> This is the type of result I sometimes get from an R script that I cannot
> reproduce here, because it depends on a terminal process started in a
> compiled Electron (Node.js) application, under MacOS.
>
> I was wondering, is there a standard way, perhaps also using iconv(), to
> convert this type of result to a more manageable unicode representation?
>
> Something like: "fa\u00e7ile"
>
> Or perhaps a clever regexp, for any number of such occurrences in a string?
>
> Thanks a lot in advance,
> Adrian
>
>         [[alternative HTML version deleted]]
>
> ______________________________________________
> R-help using r-project.org mailing list -- To UNSUBSCRIBE and more, see
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.



More information about the R-help mailing list