[R] Identifying duplicate rows?
Rui Barradas
ruipbarradas at sapo.pt
Mon Sep 10 21:13:23 CEST 2012
Hello,
Please use dput(dfA) to post your data examples. The following is it's
output. All one has to do is to copy and paste to an R session to get
the data example.
dfA <-
structure(list(Site = c(929L, 929L, 930L, 930L, 931L, 932L),
State = structure(c(1L, 1L, 1L, 1L, 1L, 1L), .Label = "VA", class =
"factor"),
Value = c(73L, 73L, 76L, 76L, 74L, 75L)), .Names = c("Site",
"State", "Value"), class = "data.frame", row.names = c(NA, -6L))
# Now use the argument 'fromLast'
dfA$dups <- duplicated(dfA) | duplicated(dfA, fromLast = TRUE)
Hope this helps,
Rui Barradas
Em 10-09-2012 19:23, kborgmann escreveu:
> Hi,
> I am trying to identify duplicate values in a column in a date frame. The
> duplicated function identifies the duplicate rows in the data frame but it
> only does this for the second record, not both records. Is there a way to
> mark both rows in the data frame as TRUE?
> dfA$dups<-duplicated(dfA$Value)
> dfA
> Site State Value dups
> 929 VA 73 FALSE
> 929 VA 73 TRUE
> 930 VA 76 FALSE
> 930 VA 76 TRUE
> 931 VA 74 FALSE
> 932 VA 75 FALSE
>
> But I would like this
> Site State Value dups
> 929 VA 73 TRUE
> 929 VA 73 TRUE
> 930 VA 76 TRUE
> 930 VA 76 TRUE
> 931 VA 74 FALSE
> 932 VA 75 FALSE
>
> Thank you for your replies!
> Still on the learning curve,
> Kathi
>
>
>
> --
> View this message in context: http://r.789695.n4.nabble.com/Identifying-duplicate-rows-tp4642679.html
> Sent from the R help mailing list archive at Nabble.com.
>
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
More information about the R-help
mailing list