[R] Help summarizing R data frame
jim holtman
jholtman at gmail.com
Thu Dec 2 17:53:37 CET 2010
Nice thing about R is there are a number of ways to do things:
> x
identifier quantity
1 1 10
2 1 20
3 2 30
4 1 15
5 2 10
6 3 20
> require(sqldf)
> sqldf('select identifier, sum(quantity) as quantity from x group by identifier')
identifier quantity
1 1 45
2 2 40
3 3 20
>
or using 'data.table'
> require(data.table)
Loading required package: data.table
> x <- data.table(x)
> x[, sum(quantity), by = identifier]
identifier V1
[1,] 1 45
[2,] 2 40
[3,] 3 20
On Thu, Dec 2, 2010 at 11:24 AM, chris99 <cheakes at hotmail.com> wrote:
>
> I am trying to aggregate data in column 2 to identifiers in col 1
>
> eg..
>
> take this>
>
> identifier quantity
> 1 10
> 1 20
> 2 30
> 1 15
> 2 10
> 3 20
>
> and make this>
>
> identifier quantity
> 1 45
> 2 40
> 3 20
>
>
> Thanks in advance for your help!
> --
> View this message in context: http://r.789695.n4.nabble.com/Help-summarizing-R-data-frame-tp3069624p3069624.html
> Sent from the R help mailing list archive at Nabble.com.
>
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
>
--
Jim Holtman
Data Munger Guru
What is the problem that you are trying to solve?
More information about the R-help
mailing list