[R] "not all duplicated" question
arun
smartpink111 at yahoo.com
Sun Jul 14 03:08:59 CEST 2013
Hi,
May be this helps:
dat1<- read.table(text="
Country, Pet
France, Dog
France, Cat
France, Dog
Canada, Cat
Canada, Cat
Japan, Dog
Japan, Cat
Italy, Cat
",sep=",",header=TRUE,stringsAsFactors=FALSE)
dat1[with(dat1,as.numeric(ave(Pet,Country,FUN=function(x) length(unique(x)))))>1,]
# Country Pet
#1 France Dog
#2 France Cat
#3 France Dog
#6 Japan Dog
#7 Japan Cat
A.K.
----- Original Message -----
From: Vesco Miloushev <vesco.miloushev at gmail.com>
To: r-help at r-project.org
Cc:
Sent: Saturday, July 13, 2013 4:12 PM
Subject: [R] "not all duplicated" question
Hi,
I want to select elements which have duplicates by are not all duplicated.
Here is what I mean. Suppose I have a two column matrix with columns
"Country" and "Pet"
Country, Pet
------------------
France, Dog
France, Cat
France, Dog
Canada, Cat
Canada, Cat
Japan, Dog
Japan, Cat
Italy, Cat
I want to extract all the entries that are duplicated in column
"Country" but not ALL duplicated in column "Pet".
In this case I want
Country, Pet
------------------
France, Dog
France, Cat
France, Dog
Japan, Dog
Japan, Cat
Notice that I keep France, because not all are duplicated. If there
was no entry "France, Cat" then it all of the entries with "France"
would be eliminated.
Thanks for your help.
______________________________________________
R-help at r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.
More information about the R-help
mailing list