[Rd] UTF-8 and .Rd files

Göran Broström goran.brostrom at gmail.com
Tue Jun 27 20:30:52 CEST 2006


On 6/27/06, Spencer Graves <spencer.graves at pdf.com> wrote:
> Hello, Göran:
>
>           Have you considered the German solution:  "Goeran"?  (e.g., Wuertz
> for Würtz)?

Yes, but really not; I like your p.s. solution better!

>           Be thankful that you aren't Russian or Greek or Arabic or Chinese,
> etc., for which there may be no standard transliteration into the Latin
> alphabet.
>
>           Sorry I can't be more helpful.
>
>           Spencer Graves
> p.s.  When I'm with native Spanish speakers who don't know English, I
> pronounce my name very differently, like "Espencer Gra-ve", to match how
> they would pronounce my name when they see it written.  Similarly, I
> once heard a French Canadian take about his young son, Guillaume.  If
> you ask him in English, "What's your name?" he replies, "Bill".  If you
> ask the same question in French, he replies, "Guillaume".

Good idea! I call myself "George" in English, "Yuri" in Russian,
"Goran" on Balkan, etc.

Seriously, I thoght that unicode and utf8 would make problems like
this disappear, but obviously we may have to wait another 30 years.

Thanks for all the input.

George

> Hin-Tak Leung wrote:
> > Göran Broström wrote:
> >> On 6/27/06, Prof Brian Ripley <ripley at stats.ox.ac.uk> wrote:
> >>> On Tue, 27 Jun 2006, Göran Broström wrote:
> >>>
> >>>> I have been converting to utf8 from latin1, and this gives me
> >>>> problems, some solved, but here is one unsolved: In my .Rd files, I
> >>>> have included '\encoding{UTF-8}' at the top. Despite this, the HTML
> >>>> help pages contains 'content="text/html; charset=iso-8859-1"', and my
> >>>> name is mangled. What can I do about this?
> >>> Reproducible example, please!  (I've just tried this and it works for me.)
> >>>
> >>> As described in my talk at UseR 2006, you may well not want to do this if
> >>> you intend to distribute the package.  Your name contains characters that
> >>> are not in the fonts used in UTF-8 in non-European locales, and Windows
> >>> users do no have ready access to UTF-8 viewers (even if they know the
> >>> files are UTF-8).
> >> Thanks for your answer! So this means that 'latin1' does not cause
> >> problems for non-European locales and Windows users, I take it.
> >>
> >> I really only need non-ascii to write the name ot the author (me)
> >> correctly. I tried LaTeX code ({\"o}), but that didn't work. Is there
> >> a way around this?
> >>
> >> Göran
> >
> > The \"o character in my latin1 (iso 8859-1) man page says it is 0xF6
> >   F6 - LATIN SMALL LETTER O WITH DIAERESIS
> > The capital version is
> >   D6 -  LATIN CAPITAL LETTER O WITH DIAERESIS
> >
> > in html I think you need to do &#F6; or something for that character to
> > appear?
> >
> > HTH
> >
> > HTL
> >
> > ______________________________________________
> > R-devel at r-project.org mailing list
> > https://stat.ethz.ch/mailman/listinfo/r-devel
>


-- 
Göran Broström



More information about the R-devel mailing list