[R] Tagging identical rows of a matrix
Douglas Bates
bates at stat.wisc.edu
Fri May 14 20:52:59 CEST 2004
Scott Waichler <scott.waichler at pnl.gov> writes:
> I would like to generate a vector having the same length
> as the number of rows in a matrix. The vector should contain
> an integer indicating the "group" of the row, where identical
> matrix rows are in a group, and a unique row has a unique integer.
> Thus, for
>
> a <- c(1,2)
> b <- c(1,3)
> c <- c(1,2)
> d <- c(1,2)
> e <- c(1,3)
> f <- c(2,1)
> mat <- rbind(a,b,c,d,e,f)
>
> I would like to get the vector c(1,2,1,1,2,3). I know dist() gives
> part of the answer, but I can't figure out how to use it for
> this purpose without doing a lot of looping. I need to apply this
> to matrices up to ~100000 rows.
I believe you want to start with unique which, when applied to a
matrix, provides the unique rows.
> unique(mat)
[,1] [,2]
a 1 2
b 1 3
f 2 1
I'm sure others will be able to provide clever ways of doing the
matching against the unique rows.
More information about the R-help
mailing list