[R] [OT] correspondence analysis w/ non-mutually-exclusive ca tegories
Khawaja Marwan
Marwan.Khawaja at fafo.no
Fri Mar 2 10:12:04 CET 2001
Andy,
Take a look at Greenacre, Theory and Applications of Correspondence
Analysis. He has many example of dealing with all sorts of data.
Basically, the technique is relevant for 2-way tables -- MCA is an
extenstion. It is not clear in your example whether CA is really
appropriate -- you want to make an observation (if at all possible) fall in
one cell, treating the others layers as 'supplementary' points -- without
necessarily contributing to the 'factors.' So the first step is to cross
classify your data into a table and then apply CA.
Marwan Khawaja
Fafo Institute
Borggt. 2B, P.O.Box 2947 Tøyen
N-0608 Oslo, Norway
> -----Original Message-----
> From: Andrew Perrin [SMTP:aperrin at socrates.berkeley.edu]
> Sent: 1. mars 2001 21:50
> To: r-help at r-project.org
> Subject: [R] [OT] correspondence analysis w/ non-mutually-exclusive
> categories
>
> Greetings, again. This is not strictly an R question, so please feel free
> to ignore it if you like.
>
> My question is about the substance of correspondence
> analysis. Specifically, is it appropriate to use ca on a matrix of values
> such that the columns and/or rows are not mutually exclusive? To be more
> detailed:
>
> - The standard use of ca is illustrated in the example of corresp() (from
> MASS):
>
> data(caith)
> library(mva)
> corresp(caith)
> biplot(corresp(caith, nf=2))
>
> > caith
> fair red medium dark black
> blue 326 38 241 110 3
> light 688 116 584 188 4
> medium 343 84 909 412 26
> dark 98 48 403 681 85
>
> in this table, presumably, an observation can fall in only one
> cell: red/light, say, or dark/fair.
>
> - However, my data are different, in that a single observation can
> (theoretically) fall in two or more cells. Consider:
>
> voted98 voted00 donated protested no_partic
> male
> female
>
> a given observation might fall, for example, in male/voted98 and
> male/voted00. What are the implications of this?
>
> - I am aware of the multiple correspondence technique, which I believe
> answers (some of) this issue. However I have a different problem with
> it: I have so many observations (ca. 5700) that the plot becomes
> unreadable. That's because each *observation* is plotted in mca, whereas
> each unique profile is what's plotted in ca.
>
> Any advice will be met with tremendous gratitude :)
>
> Andy Perrin
>
> ----------------------------------------------------------------------
> Andrew J Perrin - Ph.D. Candidate, UC Berkeley, Dept. of Sociology
> Chapel Hill, North Carolina, USA - http://demog.berkeley.edu/~aperrin
> aperrin at socrates.berkeley.edu - aperrin at igc.apc.org
>
> -.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.
> -.-.-
> r-help mailing list -- Read
> http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html
> Send "info", "help", or "[un]subscribe"
> (in the "body", not the subject !) To: r-help-request at stat.math.ethz.ch
> _._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._.
> _._._
-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-
r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html
Send "info", "help", or "[un]subscribe"
(in the "body", not the subject !) To: r-help-request at stat.math.ethz.ch
_._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._
More information about the R-help
mailing list