[Rd] unique.default problem (PR#12551)

ripley at stats.ox.ac.uk ripley at stats.ox.ac.uk
Sun Aug 17 11:40:13 CEST 2008

The problem here is that there are more incomparables than x. But in any 
case the answer was incorrect:

> unique(rep("a", 3), "a")
[1] "a"

I've fixed both issues via a different algorithm for 2.7.2.

On Sat, 16 Aug 2008, prokaj at cs.elte.hu wrote:

> Full_Name: Vilmos Prokaj
> Version: R 2.7.1
> OS: windows
> Submission from: (NULL) (
> Dear developers,
> The following line of code (produced by a mistake) caused an infinite loop
> unique("a",c("a","b"))
> or also
> unique(1,1:2)
> I made   a little investigation, and it seems to be that the following function
> from unique.c is looping infinitely
> static int isDuplicated(SEXP x, int indx, HashData *d)
> {
>    int i, *h;
>    h = INTEGER(d->HashTable);
>    i = d->hash(x, indx, d);
>    while (h[i] != NIL) {
> 	if (d->equal(x, h[i], x, indx))
> 	    return h[i] >= 0 ? 1 : 0;
> 		i = (i + 1) % d->M;
> 	    }
> 	    h[i] = indx;
>    return 0;
> }
> In this case h contains only one negative value, which causes d->equal(=requal)
> to return 0.
> static int requal(SEXP x, int i, SEXP y, int j)
> {
>    if (i < 0 || j < 0) return 0;
>    if (!ISNAN(REAL(x)[i]) && !ISNAN(REAL(y)[j]))
> 	return (REAL(x)[i] == REAL(y)[j]);
>    else if (R_IsNA(REAL(x)[i]) && R_IsNA(REAL(y)[j])) return 1;
>    else if (R_IsNaN(REAL(x)[i]) && R_IsNaN(REAL(y)[j])) return 1;
>    else return 0;
> }
> I do not claim that the situation above is frequent or even meaningful, however
> it should not cause a crash of R.
> Sincerely yours
> Vilmos Prokaj
> ______________________________________________
> R-devel at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-devel

Brian D. Ripley,                  ripley at stats.ox.ac.uk
Professor of Applied Statistics,  http://www.stats.ox.ac.uk/~ripley/
University of Oxford,             Tel:  +44 1865 272861 (self)
1 South Parks Road,                     +44 1865 272866 (PA)
Oxford OX1 3TG, UK                Fax:  +44 1865 272595

More information about the R-devel mailing list