[R] problem about set operation and computation after split
R. Michael Weylandt
michael.weylandt at gmail.com
Wed Jun 13 06:43:55 CEST 2012
On Wed, Jun 6, 2012 at 2:43 AM, bestbird <bestbird7788 at gmail.com> wrote:
> hi,
> I met some problems in R, plz help me.
> 1. How to do a intersect operation among several groups in one list, without
> a loop statement? (I think It may be a list)
> create data:
> myData <- data.frame(product = c(1,2,3,1,2,3,1,2,2),
> year=c(2009,2009,2009,2010,2010,2010,2011,2011,2011),value=c(1104,608,606,1504,508,1312,900,1100,800))
> mySplit<- split(myData,myData$year)
> mySplit
> $`2009`
> product year value
> 1 1 2009 1104
> 2 2 2009 608
> 3 3 2009 606
>
> $`2010`
> product year value
> 4 1 2010 1504
> 5 2 2010 508
> 6 3 2010 1312
>
> $`2011`
> product year value
> 7 1 2011 900
> 8 2 2011 1100
> 9 2 2011 800
> I want to get intersection of product between every year. I know the
> basic is:
> intersect(intersect(mySplit[[1]]$product,
> mySplit[[2]]$product),mySplit[[3]]$product)
> this will give the correct answer:
> [1] 1 2
> above code lacks reusability, so It should use a for loop:
> myIntersect<-mySplit[[1]]$product
> for (i in 1:length(mySplit)-1){
> myIntersect<-intersect(myIntersect,mySplit[[i+1]]$product)
> }
> It's correct too, but stll too complex, so my question is:
> Can I do the same thing just use another similar intersect function
> (without for/repeat/while).
> What's this simple function's name ?
Take a look at Reduce()
>
> 2.how to do a relative computation after split (notice: not befor split)?
> create data:
> myData1 <- data.frame(product = c(1,2,3,1,2,3),
> year=c(2009,2009,2009,2010,2010,2010),value=c(1104,608,606,1504,508,1312),relative=0)
> mySplit1<- split(myData1,myData1$year)
> mySplit1
> $`2009`
> product year value relative
> 1 1 2009 1104 0
> 2 2 2009 608 0
> 3 3 2009 606 0
>
> $`2010`
> product year value relative
> 4 1 2010 1504 0
> 5 2 2010 508 0
> 6 3 2010 1312 0
> I want compute relative value in the every group, what I mean is , I want
> get the result is just like below:
> $`2009`
> product year value relative
> 1 1 2009 1104 0
> 2 2 2009 608 -496
> 3 3 2009 606 -2
>
> $`2010`
> product year value relative
> 4 1 2010 1504 0
> 5 2 2010 508 -996
> 6 3 2010 1312 804
> I think to use a loop maybe work, but Is there no direct method on list?
lapply ?
possibly in conjunction with sweep.
>
> 3.how to do a sorting after split, It's just like above question, what I
> want is sorting by value:
> $`2009`
> product year value relative
> 3 3 2009 606 0
> 2 2 2009 608 0
> 1 1 2009 1104 0
> $`2010`
> product year value relative
> 5 2 2010 508 0
> 6 3 2010 1312 0
> 4 1 2010 1504 0
lapply
probably you'll need order for this one.
>
> 4. how to do a filtering after split, Yes, It's just like above quetion,
> what I want is filtering out data which value is more than 1000:
> $`2009`
> product year value relative
> 1 1 2009 1104 0
> $`2010`
> product year value relative
> 4 1 2010 1504 0
> 6 3 2010 1312 0
lapply
which()
and logical indexing.
Michael
>
> --
> View this message in context: http://r.789695.n4.nabble.com/problem-about-set-operation-and-computation-after-split-tp4632496.html
> Sent from the R help mailing list archive at Nabble.com.
>
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
More information about the R-help
mailing list