[Rd] How to get utf8 string using R externals

David Winsemius dw|n@em|u@ @end|ng |rom comc@@t@net
Thu Jun 3 00:09:14 CEST 2021


First; you should configure yopu mail client to send plain text.

Can you explain what is meant by:

the characters are unicodes (<U+BD80><U+C2E4>) instead of
utf8 encoding of the korean characters 부실.

As far as I can tell those two unicodes _are_ the utf8 encodings of 부실.

You may need to consult a couple of R help pages. I suggest:

?Quotes
?points   # has examples of changing fonts used for display on console.

Sorry if I've misunderstood. I'm not on a Windows device, so  posting the C++ program won't be helpful, but maybe it would for other prospective respondents.

-- 
David.

On 6/2/21 1:33 PM, xiaoyan yu wrote:
> I have a R Script Predict.R:
>      set.seed(42)
>      C <- seq(1:1000)
>      A <- rep(seq(1:200),5)
>      E <- (seq(1:1000) * (0.8 + (0.4*runif(50, 0, 1))))
>      L <- ifelse(runif(1000)>.5,1,0)
>      df <- data.frame(cbind(C, A, E, L))
> load("C:/Temp/tree.RData")                #  load the model for scoring
>
>    P <- as.character(predict(tree_model_1,df,type='class'))
>
> Then in a C++ program
> I call eval to evaluate the script and then findVar the P variable.
> After get each class label from P using string_elt and then
> Rf_translateChar, the characters are unicodes (<U+BD80><U+C2E4>) instead of
> utf8 encoding of the korean characters 부실.
> Can I know how to get UTF8 by using R externals?
>
> I also found the same script giving utf8 characters in RGui but unicode in
> Rterm.
> I tried to attach a screenshot but got message "The message's content type
> was not explicitly allowed"
> In RGui, I saw the output 부실, while in Rterm, <U+BD80><U+C2E4>.
>
> Please help.
>
> 	[[alternative HTML version deleted]]
>
> ______________________________________________
> R-devel using r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-devel



More information about the R-devel mailing list