[R] compositional data: percent values sum up to 1
Spencer Graves
spencer.graves at pdf.com
Mon Jun 2 21:19:25 CEST 2003
Hi, Christoph:
Andy Liaw's suggestion sounds sensible to me, though I don't have
much experience with this kind of data either.
OTHER QUESTIONS:
* How big is J? I'm guessing it might be quite large, but I don't
know.
* Are the spectra relatively smooth? I wonder if it might be
appropriate to try to smooth the data some way preliminary to other
analyses.
* How many observations do you have in each of "ill" and "healthy",
especially relative to J?
I might try to do a principal components analysis (or "svd" if
"princomp" bombed because of singular matrices) on the covariance matrix
of the spectra. Then I might want to test how different the spectra were.
hope this helps. spencer graves
Liaw, Andy wrote:
> Eh? The original message says it's the design matrix that is perfectly
> collinear after the transformation, not the response.
>
> I don't know much about this type of data, but seems like you could just fit
> the model w/o intercept to eliminate the collinearity, no? It's the
> interpretation of the result that may be tricky, I think.
>
> Andy
>
>
>
>>-----Original Message-----
>>From: Spencer Graves [mailto:spencer.graves at pdf.com]
>>Sent: Monday, June 02, 2003 9:33 AM
>>To: Christoph Lehmann
>>Cc: Spencer Graves; r-help at stat.math.ethz.ch
>>Subject: Re: [R] compositional data: percent values sum up to 1
>>
>>
>>"glm" will do multinomial logistic regression. However, if J
>>is large,
>>I doubt if that will do what you want. If it were my
>>problem, I might
>>feel a need to read the code for "glm" and modify it to do
>>what I want.
>> Perhaps someone else can suggest something better.
>>
>>hth. spencer graves
>>
>>Christoph Lehmann wrote:
>>
>>>I want to do a logistic regression analysis, and to compare with, a
>>>discriminant analysis. The mentioned power maps are my
>>
>>exogenous data,
>>
>>>the dependent variable (not mentioned so far) is a diagnosis
>>>(ill/healthy)
>>>
>>>thanks for the interest and the help
>>>
>>>Christoph
>>>
>>>On Sun, 2003-06-01 at 21:01, Spencer Graves wrote:
>>>
>>>
>>>>What are you trying to do? What I would do with this
>>>
>>depends on many
>>
>>>>factors.
>>>>
>>>>spencer graves
>>>>
>>>>Christoph Lehmann wrote:
>>>>
>>>>
>>>>>again, under another subject:
>>>>>sorry, maybe an all too trivial question. But we have
>>>>
>>power data from J
>>
>>>>>frequency spectra and to have the same range for the data
>>>>
>>of all our
>>
>>>>>subjects, we just transformed them into % values, pseudo-code:
>>>>>
>>>>>power[i,j]=power[i,j]/sum(power[i,1:J])
>>>>>
>>>>>of course, now we have a perfect linear relationship in
>>>>
>>our x design-matrix,
>>
>>>>>since all power-values for each subject sum up to 1.
>>>>>
>>>>>How shall we solve this problem: just eliminate one column of x, or
>>>>>introduce a restriction which says exactly that our power
>>>>
>>data sum up to
>>
>>>>>1 for each subject?
>>>>>
>>>>>Thanks a lot
>>>>>
>>>>>Christoph
>>>>
>>>>______________________________________________
>>>>R-help at stat.math.ethz.ch mailing list
>>>>https://www.stat.math.ethz.ch/mailman/listinfo/r-help
>>>
>>______________________________________________
>>R-help at stat.math.ethz.ch mailing list
>>https://www.stat.math.ethz.ch/mailman/listinfo/r-help
>>
>
>
> ______________________________________________
> R-help at stat.math.ethz.ch mailing list
> https://www.stat.math.ethz.ch/mailman/listinfo/r-help
More information about the R-help
mailing list