[R] Reshape question.
rkevinburton at charter.net
rkevinburton at charter.net
Wed Mar 11 20:47:03 CET 2009
This hopefully is trivial. I am trying to reshape the data using the reshape package.
First I read in the data:
a2009 <- read.csv("Total2009.dat", header = TRUE)
Then I trim it so that it only contains the columns that I have interested in:
m2009 <- melt(a2009, id.var=c("DayOfYear","Category","SubCategory","Sku"), measure.var=c("Quantity"), na.rm=TRUE)
Then I start to formulate the data that I will process:
c2009 <- cast(m2009, DayOfYear ~ variable | Category, sum)
Finally I aggregate the data:
t2009 <- cast(m2009, DayOfYear ~ variable, sum)
My question is on the third step above (repeated here)
c2009 <- cast(m2009, DayOfYear ~ variable | Category, sum)
This gets the data assocated with a unique 'Category' name. I want to get the data grouped by 'Category' and 'SubCategory'. The 'SubCategory' is not unique but the combination 'Category' and 'SubCategory' form a unique pair. What would be the formula that would give me the data grouped by Category AND SubCategory? Would it be as simple as:
c2009 <- cast(m2009, DayOfYear ~ variable | Category & SubCategory, sum)
?
Thank you for your suggestions.
Kevin
More information about the R-help
mailing list