[R] unique mismatch in R and Excel

Barry Rowlingson b.rowlingson at lancaster.ac.uk
Wed Dec 25 11:13:40 CET 2013


We answered this on StackOverflow already. Excel was doing
case-insensitive duplicate matching.

http://stackoverflow.com/questions/20759346/counting-unique-values-in-r-and-excel/20759523#20759523

Barry

On Tue, Dec 24, 2013 at 5:43 PM, David Winsemius <dwinsemius at comcast.net> wrote:
>
> On Dec 24, 2013, at 1:08 AM, Koushik Saha wrote:
>
>> i have a wired problem. i want to count the unique entry in a certain
>> column.Here i have attached my csv file.
>
> Files named with extension .csv do not typically make it through the R-help mail server.
>
>>
>> i am doing this to get the unique entries in the column.
>>
>> dat<-read.csv("C:/Project/Gawk-scripts/Book1.csv")
>> names(dat)<-c("user_name")
>> unique(dat$user_name)
>>
>> results says i have 170 unique values.
>>
>>
>> But i am doing "remove duplicate entries"  in excel i am having 147 unique
>> entries in the column.
>>
>> Can anyone explain why there is a mismatch of the results or i am doing
>> something wrong.
>>
>
> Rename the file to have an extension of .txt. Then you mail-client will probably label it correctly as a MIME-TEXT file.
>
> --
> David Winsemius
> Alameda, CA, USA
>
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.



More information about the R-help mailing list