[R] all duplicated wanted
arun
smartpink111 at yahoo.com
Fri Aug 3 18:50:03 CEST 2012
Hi Weijia,
Try this:
dat1<-read.table(text="
ID AGE GENER LDL
1 25 M 137
1 25 M 125
2 34 F 108
3 30 F 150
3 30 F 147
3 30 F 165
",sep="",header=TRUE,stringsAsFactors=FALSE)
subset(dat1,!is.na(match(AGE,AGE[duplicated(AGE)])))
#ID AGE GENER LDL
#1 1 25 M 137
#2 1 25 M 125
#4 3 30 F 150
#5 3 30 F 147
#6 3 30 F 165
#I guess this helps you.
A.K.
----- Original Message -----
From: wwang.nyu <wwang.nyu at gmail.com>
To: arun <smartpink111 at yahoo.com>
Cc:
Sent: Friday, August 3, 2012 11:59 AM
Subject: Re: [R] all duplicated wanted
Hi, A.K.
Thanks for your example, that is enlighten.
What I am concerned is, to show all duplicates.
'duplicated' only shows me the second duplicate of each ID.
For example, I have these
ID AGE GENER LDL
1 25 M 137
1 25 M 125
2 34 F 108
3 30 F 150
3 30 F 147
3 30 F 165
I want this:
ID AGE GENER LDL
1 25 M 137
1 25 M 125
3 30 F 150
3 30 F 147
3 30 F 165
Instead of having this, which 'duplicated' will give me:
ID AGE GENER LDL
1 25 M 125
3 30 F 165
Can you help me on this?
Thanks
Weijia Wang
On Aug 3, 2012, at 9:20 AM, arun <smartpink111 at yahoo.com> wrote:
>
>
> HI,
>
> If you want both the duplicated IDs and nonduplicated IDs to be printed,
> ID <- sample(1:10, 10, replace=TRUE)
> ID
> # [1] 3 7 5 8 1 5 4 6 7 2
> ID[!duplicated(ID)]
> #[1] 3 7 5 8 1 4 6 2
>
> ID2<-c(4,4,4,3,4,1,2,5,7,4,3,2,5,9,8,12,"A1","A2","A1","B1")
> ID2[!duplicated(ID2)]
> # [1] "4" "3" "1" "2" "5" "7" "9" "8" "12" "A1" "A2" "B1"
> ID2<-c(4,4,4,3,4,1,2,5,7,4,3,2,5,9,8,12,12,1,3,5,8)
> # ID2[!duplicated(ID2)]
> [1] 4 3 1 2 5 7 9 8 12
>
>
> I hope this is what you wanted.
>
> A.K.
>
>
>
>
> ----- Original Message -----
> From: Weijia Wang <wwang.nyu at gmail.com>
> To: r-help at r-project.org
> Cc:
> Sent: Friday, August 3, 2012 3:06 AM
> Subject: [R] all duplicated wanted
>
> Hi,
>
> Has anyone been able to figure out how to print all duplicated observations?
>
> I have a dataset, with patients ID, and other lab records.
>
> Some patients have multiple lab records, but 'duplicated' ID will only show me the duplicates, not the original observation.
>
> How can I print both the original one and the duplicates?
>
> Thanks
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
>
More information about the R-help
mailing list