[Rd] UTF-8 and .Rd files
Prof Brian Ripley
ripley at stats.ox.ac.uk
Tue Jun 27 20:22:58 CEST 2006
We describe how to use \enc for possible transliterations for exactly this
purpose in the `Writing R Extensions' manual.
In answer to Göran's question, yes latin1 is safer than UTF-8 for HTML
browsers but neither are guaranteed to contain a glyph for ö in a font
used e.g. in a Russian locale.
On Tue, 27 Jun 2006, Spencer Graves wrote:
> Hello, Göran:
>
> Have you considered the German solution: "Goeran"? (e.g., Wuertz
> for Würtz)?
>
> Be thankful that you aren't Russian or Greek or Arabic or Chinese,
> etc., for which there may be no standard transliteration into the Latin
> alphabet.
>
> Sorry I can't be more helpful.
>
> Spencer Graves
> p.s. When I'm with native Spanish speakers who don't know English, I
> pronounce my name very differently, like "Espencer Gra-ve", to match how they
> would pronounce my name when they see it written. Similarly, I once heard a
> French Canadian take about his young son, Guillaume. If you ask him in
> English, "What's your name?" he replies, "Bill". If you ask the same
> question in French, he replies, "Guillaume".
>
> Hin-Tak Leung wrote:
>> Göran Broström wrote:
>>> On 6/27/06, Prof Brian Ripley <ripley at stats.ox.ac.uk> wrote:
>>>> On Tue, 27 Jun 2006, Göran Broström wrote:
>>>>
>>>>> I have been converting to utf8 from latin1, and this gives me
>>>>> problems, some solved, but here is one unsolved: In my .Rd files, I
>>>>> have included '\encoding{UTF-8}' at the top. Despite this, the HTML
>>>>> help pages contains 'content="text/html; charset=iso-8859-1"', and my
>>>>> name is mangled. What can I do about this?
>>>> Reproducible example, please! (I've just tried this and it works for
>>>> me.)
>>>>
>>>> As described in my talk at UseR 2006, you may well not want to do this if
>>>> you intend to distribute the package. Your name contains characters that
>>>> are not in the fonts used in UTF-8 in non-European locales, and Windows
>>>> users do no have ready access to UTF-8 viewers (even if they know the
>>>> files are UTF-8).
>>> Thanks for your answer! So this means that 'latin1' does not cause
>>> problems for non-European locales and Windows users, I take it.
>>>
>>> I really only need non-ascii to write the name ot the author (me)
>>> correctly. I tried LaTeX code ({\"o}), but that didn't work. Is there
>>> a way around this?
>>>
>>> Göran
>>
>> The \"o character in my latin1 (iso 8859-1) man page says it is 0xF6
>> F6 - LATIN SMALL LETTER O WITH DIAERESIS
>> The capital version is
>> D6 - LATIN CAPITAL LETTER O WITH DIAERESIS
>>
>> in html I think you need to do &#F6; or something for that character to
>> appear?
>>
>> HTH
>>
>> HTL
>>
>> ______________________________________________
>> R-devel at r-project.org mailing list
>> https://stat.ethz.ch/mailman/listinfo/r-devel
>
>
--
Brian D. Ripley, ripley at stats.ox.ac.uk
Professor of Applied Statistics, http://www.stats.ox.ac.uk/~ripley/
University of Oxford, Tel: +44 1865 272861 (self)
1 South Parks Road, +44 1865 272866 (PA)
Oxford OX1 3TG, UK Fax: +44 1865 272595
More information about the R-devel
mailing list