[R] use of Encoding()?
Olivier Crouzet
olivier.crouzet at univ-nantes.fr
Fri Feb 3 20:51:39 CET 2017
On Fri, 3 Feb 2017 11:23:02 -0800
David Winsemius <dwinsemius at comcast.net> wrote:
Oups, I (erroneously) tried with accented characters, which explains my
answer. Actually, I (correctly) get "unknown" if using characters from
the ASCII set, so my understanding is that there's actually no problem
with the OP's request as there's no reason why "16-03-02" should be
represented as anything else than "unknown" according to this
information (all characters are in the ASCII set).
Olivier.
>
> > On Feb 3, 2017, at 10:12 AM, Olivier Crouzet
> > <olivier.crouzet at univ-nantes.fr> wrote:
> >
> > Hi,
> >
> > using R version 3.3.2 under Linux, these work perfectly (but I
> > receive a correct encoding ("UTF-8"), not "unknown").
> >
> > What is your system (windows, mac, linux)? Your R version? Which
> > interface (RStudio, Windows R interface)? There are often issues
> > with character encoding using Windows (in many different programming
> > languages) but it may not be the case concerning R.
>
> I'm wondering if it's being done on a Mac, since I see the same
> behavior at my console (the "standard" R.app GUI). If the issue is
> with reading a Windows file while using one of the `read.*`
> functions, then setting the `fileEncoding` parameter to one of
> 'iso-8859-1' or 'cp1252' may be attempted.
>
> The ?Encodings page says: "ASCII strings will never be marked with a
> declared encoding, since their representation is the same in all
> supported encodings."
>
> Running the example in the help page (on a Mac):
>
> > x <- "fa\xE7ile"
> > Encoding(x)
> [1] "unknown"
> > Encoding(x) <- "latin1"
> > x
> [1] "façile"
> > Encoding(x)
> [1] "latin1"
>
>
> --
> David.
> >
> > If these operations are meant to read data from a file, you may
> > alternatively consider the option fileEncoding= from read.table /
> > read.csv (to change encoding) or, perhaps but I would
> > suggets first trying the preceding option, encoding= (to
> > specifically declare the file encoding if you know it but R does
> > not detect it).
> >
> > Olivier.
> >
> >
> > On Fri, 3 Feb 2017 17:29:20 +0100 Tilmann Faul
> > <Tilmann_Faul at t-online.de> wrote:
> >
> >> Hey,
> >>
> >> this is my first question here, so forgive me if i my be clumsy.
> >>
> >> I want to use Encoding to set the encoding of a character vector,
> >> but it doese not seem to work. See example.
> >>
> >>> x <- "16-03-02"
> >>> Encoding(x)
> >> [1] "unknown"
> >>> Encoding(x) <- "latin1"
> >>> Encoding(x)
> >> [1] "unknown"
> >>
> >> Is this intended?
> >> Actually i want to change encoding of a character vector generated
> >> by list.file on a linux computerwith UTF-8 file encoding, rstudio
> >> encoding is iso8859-15.
> >> Any hints?
> >>
> >> best Tilmann
> >>
> >> ______________________________________________
> >> R-help at r-project.org mailing list -- To UNSUBSCRIBE and more, see
> >> https://stat.ethz.ch/mailman/listinfo/r-help
> >> PLEASE do read the posting guide
> >> http://www.R-project.org/posting-guide.html and provide commented,
> >> minimal, self-contained, reproducible code.
> >
> >
> > --
> > Olivier Crouzet, PhD
> > Laboratoire de Linguistique de Nantes -- UMR6310
> > CNRS / Université de Nantes
> > Chemin de la Censive du Tertre -- BP 81227
> > 44312 Nantes cedex 3
> > France
> >
> > http://www.lling.univ-nantes.fr/
> >
> > ______________________________________________
> > R-help at r-project.org mailing list -- To UNSUBSCRIBE and more, see
> > https://stat.ethz.ch/mailman/listinfo/r-help
> > PLEASE do read the posting guide
> > http://www.R-project.org/posting-guide.html and provide commented,
> > minimal, self-contained, reproducible code.
>
> David Winsemius
> Alameda, CA, USA
>
--
Olivier Crouzet, PhD
Laboratoire de Linguistique de Nantes -- UMR6310
CNRS / Université de Nantes
Chemin de la Censive du Tertre -- BP 81227
44312 Nantes cedex 3
France
http://www.lling.univ-nantes.fr/
More information about the R-help
mailing list