[R] eliminating a large subset of data from a frame

Uwe Ligges ligges at statistik.uni-dortmund.de
Fri Sep 5 18:16:20 CEST 2003


Peter Whiting wrote:

> I have a data frame with 155,000 rows. One of the columns
> represents the user id (of which about 10,000 are unique).  I am
> able to isolate 1000 of these user ids (stored in a list) that
> I want to eliminate from the data set, but I don't know of an
> efficient way to do this. Certainly this would be slow:
> 
> newdf<-df
> for(i in listofbadusers) {
>  newdf<-subset(tmp,uid!=i)
> }

What about subsetting? See help("[").

One solution (not saying it is the optimal one):

  newdf <- df[!(df$uid %in% listofbadusers), ]

Uwe Ligges

> is there a better approach?
> 
> I guess I could use the opposite logic and use a list of
> good users and add their data to the new frame...
> 
> thanks,
> pete
> 
> ______________________________________________
> R-help at stat.math.ethz.ch mailing list
> https://www.stat.math.ethz.ch/mailman/listinfo/r-help




More information about the R-help mailing list