[R] Contional
jim holtman
jholtman at gmail.com
Tue Sep 23 23:48:10 CEST 2008
Is this what you want: TRUE marks the ones to be removed
> mark <- (head(x$timestamp, -1) == tail(x$timestamp, -1)) &
+ (head(x$user_id, -1) == tail(x$user_id, -1))
> x$flag <- c(FALSE, mark)
> x
Source_type timestamp user_id flag
75381 0 07-07-2008-21:03:55 848307909687 FALSE
75379 1 07-07-2008-19:52:55 848307838407 FALSE
75380 2 07-07-2008-19:54:14 848307838407 FALSE
75378 1 07-07-2008-15:24:01 848285633277 FALSE
75374 1 07-07-2008-13:39:17 848273633667 FALSE
75377 2 07-07-2008-13:39:55 848273633667 FALSE
75376 2 07-07-2008-13:39:55 848273633667 TRUE
75375 2 07-07-2008-13:56:05 848273633667 FALSE
75373 1 07-07-2008-17:11:00 848272661427 FALSE
75371 1 07-07-2008-13:19:00 848270431847 FALSE
75372 2 07-07-2008-13:19:14 848270431847 FALSE
75369 1 07-07-2008-12:49:16 848269676907 FALSE
75370 2 07-07-2008-12:49:16 848269676907 TRUE
75366 1 07-07-2008-13:29:15 848263484847 FALSE
75368 2 07-07-2008-13:29:44 848263484847 FALSE
>
On Tue, Sep 23, 2008 at 4:44 PM, Michael Pearmain <mpearmain at google.com> wrote:
> Hi All,
>
> I'm having trouble selecting rows to delete, that i can't seem to overcome.
>
> Below is some sample data, i am trying to dedup the data based on each user,
> and simultaneously the timestamp (at the side i have highlighted expected
> row to be removed)
>
> I've looked at the lag function but can't seem to make it work?
>
> My logic ran along the lines of an ifelse statement and then remove after
> that, but it doesn't seem to work? Any help appreciated
>
> Let's call the data test
>
> test$lag <- ifelse(test$user_id==lag(test$user_id)
> & test$timestamp==lag(test$timestamp),1,0)
>
> Can anyone help on this?
>
> Mike
>
>
>
> Source_type timestamp user_id
> 75381 0 07-07-2008-21:03:55 848307909687
> 75379 1 07-07-2008-19:52:55 848307838407
> 75380 2 07-07-2008-19:54:14 848307838407
> 75378 1 07-07-2008-15:24:01 848285633277
> 75374 1 07-07-2008-13:39:17 848273633667
> 75377 2 07-07-2008-13:39:55 848273633667
> 75376 2 07-07-2008-13:39:55 848273633667 Remove
> 75375 2 07-07-2008-13:56:05 848273633667
> 75373 1 07-07-2008-17:11:00 848272661427
> 75371 1 07-07-2008-13:19:00 848270431847
> 75372 2 07-07-2008-13:19:14 848270431847
> 75369 1 07-07-2008-12:49:16 848269676907 Remove
> 75370 2 07-07-2008-12:49:16 848269676907
> 75366 1 07-07-2008-13:29:15 848263484847
> 75368 2 07-07-2008-13:29:44 848263484847
>
> Thanks in advance
>
> [[alternative HTML version deleted]]
>
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
>
--
Jim Holtman
Cincinnati, OH
+1 513 646 9390
What is the problem that you are trying to solve?
More information about the R-help
mailing list