[Rd] UTF-8 and .Rd files

Hin-Tak Leung hin-tak.leung at cimr.cam.ac.uk
Tue Jun 27 20:22:21 CEST 2006


Spencer Graves wrote:
> Hello, Göran:
> 
>       Have you considered the German solution:  "Goeran"?  (e.g., Wuertz 
> for Würtz)?
> 
>       Be thankful that you aren't Russian or Greek or Arabic or Chinese, 
> etc., for which there may be no standard transliteration into the Latin 
> alphabet.

Well, I have to live with that, being of one of the above mentioned 
catergories. Where it is important to have my own name in native form
in documents, I keep around a png, a eps with postscript type 1
font embedded, and a pdf from the eps for the odd pdflatex occasions.

It is going to be very hack-ish, but I wonder if it is possible to
utilise the fact that latex comments (%) are not the same as html 
comments (<!-- -->) and vice versa, to make things work.

I seems to recall somewhere in the R-extension manual about being about 
to do \alternatives{latex stuff}{ascii stuff} for alternatives
which are destined to appear in different converted output types.
(Prof Ripley at this point would probably tell me the exact page
number and references...)

Hin-Tak

> 
>       Sorry I can't be more helpful.
> 
>       Spencer Graves
> p.s.  When I'm with native Spanish speakers who don't know English, I 
> pronounce my name very differently, like "Espencer Gra-ve", to match how 
> they would pronounce my name when they see it written.  Similarly, I 
> once heard a French Canadian take about his young son, Guillaume.  If 
> you ask him in English, "What's your name?" he replies, "Bill".  If you 
> ask the same question in French, he replies, "Guillaume".
> 
> Hin-Tak Leung wrote:
>> Göran Broström wrote:
>>> On 6/27/06, Prof Brian Ripley <ripley at stats.ox.ac.uk> wrote:
>>>> On Tue, 27 Jun 2006, Göran Broström wrote:
>>>>
>>>>> I have been converting to utf8 from latin1, and this gives me
>>>>> problems, some solved, but here is one unsolved: In my .Rd files, I
>>>>> have included '\encoding{UTF-8}' at the top. Despite this, the HTML
>>>>> help pages contains 'content="text/html; charset=iso-8859-1"', and my
>>>>> name is mangled. What can I do about this?
>>>> Reproducible example, please!  (I've just tried this and it works 
>>>> for me.)
>>>>
>>>> As described in my talk at UseR 2006, you may well not want to do 
>>>> this if
>>>> you intend to distribute the package.  Your name contains characters 
>>>> that
>>>> are not in the fonts used in UTF-8 in non-European locales, and Windows
>>>> users do no have ready access to UTF-8 viewers (even if they know the
>>>> files are UTF-8).
>>> Thanks for your answer! So this means that 'latin1' does not cause
>>> problems for non-European locales and Windows users, I take it.
>>>
>>> I really only need non-ascii to write the name ot the author (me)
>>> correctly. I tried LaTeX code ({\"o}), but that didn't work. Is there
>>> a way around this?
>>>
>>> Göran
>>
>> The \"o character in my latin1 (iso 8859-1) man page says it is 0xF6
>>   F6 - LATIN SMALL LETTER O WITH DIAERESIS
>> The capital version is
>>   D6 -  LATIN CAPITAL LETTER O WITH DIAERESIS
>>
>> in html I think you need to do &#F6; or something for that character 
>> to appear?
>>
>> HTH
>>
>> HTL
>>
>> ______________________________________________
>> R-devel at r-project.org mailing list
>> https://stat.ethz.ch/mailman/listinfo/r-devel



More information about the R-devel mailing list