[R] delete duplicated from data.frame
Uwe Ligges
ligges at statistik.uni-dortmund.de
Tue May 18 16:39:59 CEST 2004
Christian Schulz wrote:
> Hi,
>
> ?unique
> unique returns a vector, data frame or array like x * but with duplicate
> elements removed *
>
> what i'm doing wrong delete duplicated rows with same MEMEBRNO.
>
> februar <- dmsegment[unique(dmsegment$MEMBERNO),]
unique() returns values, but not indices nor logicl values.
Try instead:
februar <- dmsegment[!duplicated(dmsegment$MEMBERNO),]
Uwe Ligges
> This reduce from 197.188 rows to 184.199 but duplicated MEMBERNO didn't left
> all what a Primary Key setting in mysql me say and with a fix(februar) could
> recognize .
>
> Courious why MEMEBRNO 4,5 ,6 and 11 are left !
> dmsegment$MEMBERNO[1:10]
> [1] 1 4 5 6 7 9 10 11 16 21
>
> februar$MEMBERNO[1:10]
> [1] 1 6 7 9 10 16 21 26 53 72
>
> Using unique with a single vector it works like i expect.
>
>
> P.S.
> i try -duplcated but get not better succes?
>
> Many Thanks,
> Christian
>
> ______________________________________________
> R-help at stat.math.ethz.ch mailing list
> https://www.stat.math.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide! http://www.R-project.org/posting-guide.html
More information about the R-help
mailing list