localeToCharset {utils} | R Documentation |
Select a Suitable Encoding Name from a Locale Name
Description
This functions aims to find a suitable coding for the locale named, by default the current locale, and if it is a UTF-8 locale a suitable single-byte encoding.
Usage
localeToCharset(locale = Sys.getlocale("LC_CTYPE"))
Arguments
locale |
character string naming a locale. |
Details
The operation differs by OS.
- On Windows,
a locale is specified like
"English_United Kingdom.1252"
. The final component gives the codepage, and this defines the encoding.- On Unix-alikes:
-
Locale names are normally like
es_MX.iso88591
. If final component indicates an encoding and it is notutf8
we just need to look up the equivalent encoding name. Otherwise, the language (herees
) is used to choose a primary or fallback encoding.
In the C
locale the answer will be "ASCII"
.
Value
A character vector naming an encoding and possibly a fallback
single-encoding, NA
if unknown.
Note
The encoding names are those used by libiconv
, and ought also
to work with glibc
but maybe not with commercial Unixen.
See Also
Examples
localeToCharset()