[R] Removing row with smallest value, for a given factor

David Winsemius dwinsemius at comcast.net
Sat Apr 23 20:53:32 CEST 2011


On Apr 23, 2011, at 1:30 PM, Peter Ehlers wrote:

> On 2011-04-23 07:02, David Winsemius wrote:
>>
>> On Apr 23, 2011, at 9:05 AM, - - wrote:
>>
>>> I have a table.
>>> First column is a date, second column is an index and other columns
>>> contains some other values.
>>> I want to remove, for each date, the row with the smallest index (it
>>> is not necessarily 1).
>>>
>>> ex: in the following table, I want to remove row 1 (2013-05-12 with
>>> index 2) and row 8 (2013-05-13 with index 1)
>>>
>>>            day index values
>>> 1    2013-05-12    2  xxxx
>>> 2    2013-05-12    3  xxxx
>>> 3    2013-05-12    4  xxxx
>>> 4    2013-05-12    5  xxxx
>>> 5    2013-05-12    6  xxxx
>>> 6    2013-05-12    7  xxxx
>>> 7    2013-05-12    8  xxxx
>>> 8    2013-05-13    1  xxxx
>>> 9    2013-05-13    3  xxxx
>>> 10   2013-05-13    4  xxxx
>>> 11   2013-05-13    5  xxxx
>>> 12   2013-05-13    6  xxxx
>>> 13   2013-05-13    7  xxxx
>>> 14   2013-05-13    8  xxxx
>>> 15   2013-05-13    9  xxxx
>>> 16   2013-05-13   10  xxxx
>>> 17   2013-05-13   12  xxxx
>>>
>>
>> Consider using ave and creating a logical vector that you then  
>> negate:
>>
>>  >  ave(dat$index, list(dat$day), FUN=function(x) x==min(x))
>>   [1] 1 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0
>>
>> dat[ -ave(dat$index, list(dat$day), FUN=function(x) x==min(x)), ]
>
> ave() is one of those really handy functions, but I think
> that you meant
>
> dat[ !ave(dat$index, list(dat$day), FUN=function(x) x==min(x)), ]

Yes, that is what I should have answered. Somehow I thought that  
because it was numeric I could use "-" but that would only be correct  
if it returned line numbers. I'm not sure why it should return numeric  
rather logical, but the help page does say the value will be numeric,  
so I shouldn't be surprised, I suppose.


>
> Here's another way, using the plyr package
>
> require(plyr)
> ddply(dat, .(day), .fun = function(x) subset(x, index != min(index)))
>
> Peter Ehlers
>
>> --
>>
>> David Winsemius, MD
>> West Hartford, CT
>>
>> ______________________________________________
>> R-help at r-project.org mailing list
>> https://stat.ethz.ch/mailman/listinfo/r-help
>> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
>> and provide commented, minimal, self-contained, reproducible code.
>

David Winsemius, MD
West Hartford, CT



More information about the R-help mailing list