[Rd] gaussian family change suggestion
Prof Brian Ripley
ripley at stats.ox.ac.uk
Tue Apr 11 16:34:45 CEST 2006
On Tue, 11 Apr 2006, Simon Wood wrote:
> Hi,
>
> Currently the `gaussian' family's initialization code signals an error if
> any response data are zero or negative and a log link is used. Given that
> zero or negative response data are perfectly legitimate under the GLM
> fitted using `gaussian("log")', this seems a bit unsatisfactory. Might
> it be worth changing it?
Well, that's not really true: it says it is unable to find starting
values, and asks you to provide some. I don't really see why your choices
are satisfactory: what happens if all the data are negative for example?
(You get +Inf in the log case, I believe. The algorithm will probably get
stuck from a finite starting point, but it will produce a mysterious error
from an infinite one.)
We could try even harder, but code that is almost never used tends to get
broken whilst no one is testing it. So if you want to pursue it I think
we need a comprehensive test suite.
> The current offending code from `gaussian' is:
>
> initialize = expression({
> n <- rep.int(1, nobs)
> if (is.null(etastart) && is.null(start) && is.null(mustart) &&
> ((family$link == "inverse" && any(y == 0)) ||
> (family$link == "log" && any(y <= 0)))) stop(
> "cannot find valid starting values: please specify some")
> mustart <- y
> })
>
> A possible replacement would be ...
>
> initialize = expression({
> n <- rep.int(1, nobs)
> if (is.null(etastart) && is.null(start) && is.null(mustart) &&
> ((family$link == "inverse" && all(y == 0)) ||
> (family$link == "log" && all(y <= 0)))) stop(
> "cannot find valid starting values: please specify some")
> mustart <- y
> if (family$link=="log") {
> iy <- y<=0
> if (sum(iy)) mustart[iy] <- min(y[!iy])*.5
> } else if (family$link=="inverse") {
> iy <- y==0
> if (sum(iy)) mustart[iy] <- min(abs(y[!iy]))*.5
> }
> })
>
> best,
> Simon
>
>
>> - Simon Wood, Mathematical Sciences, University of Bath, Bath BA2 7AY
>> - +44 (0)1225 386603 www.maths.bath.ac.uk/~sw283/
>
> ______________________________________________
> R-devel at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-devel
>
>
--
Brian D. Ripley, ripley at stats.ox.ac.uk
Professor of Applied Statistics, http://www.stats.ox.ac.uk/~ripley/
University of Oxford, Tel: +44 1865 272861 (self)
1 South Parks Road, +44 1865 272866 (PA)
Oxford OX1 3TG, UK Fax: +44 1865 272595
More information about the R-devel
mailing list