[R] categorical analysis - grouping rows
cassiorx
cassiodorus at hotmail.com
Sat May 12 23:00:03 CEST 2012
I apologize up front if this has been covered elsewhere - but I can't find
any such question.
I have a data set that contains academic data: term (i.e., semester),
student id, dept, class, success (1=Y, 0=N)
I want to look at dept by term to determine descriptive statistics for
success to failure ratios. The intent being to discover if there are
departments that contribute significantly to the Simpson Paradox, that is,
that make overall success/failure rates undependable.
It's easy to use ftable to get the counts for what I need (row names dept
and success, col name success. So I get something that looks like this:
Term 1st 2nd 3rd 4th 5th
dept success
AAA 0 155 240 163 286 293
1 424 570 349 582 429
AAB 0 55 64 103 46 109
1 122 117 145 112 145
AAC 0 11 3 4 4 4
1 19 12 23 11 7
How can I calculate percentages by dept so that I get
AAA 0 27 ....
1 73 ....
AAB 0 ...
Part of my lack of understanding is that I don't see a way to get the dept
(by term) totals into a data structure that I can use to calculate the
percentages. I can write procedural code to do this but is there some r-way
that would be better?
--
View this message in context: http://r.789695.n4.nabble.com/categorical-analysis-grouping-rows-tp4629503.html
Sent from the R help mailing list archive at Nabble.com.
More information about the R-help
mailing list