[R] how to tell if its better to standardize your data matrix first when you do principal
masterinex
xevilgang79 at hotmail.com
Sun Nov 22 19:04:25 CET 2009
so under which cases is it better to standardize the data matrix first ?
also is PCA generally used to predict the response variable , should I
keep that variable in my data matrix ?
Uwe Ligges-3 wrote:
>
> masterinex wrote:
>>
>>
>> Hi guys ,
>>
>> Im trying to do principal component analysis in R . There is 2 ways of
>> doing
>> it , I believe.
>> One is doing principal component analysis right away the other way is
>> standardizing the matrix first using s = scale(m)and then apply
>> principal
>> component analysis.
>> How do I tell what result is better ? What values in particular should i
>> look at . I already managed to find the eigenvalues and eigenvectors ,
>> the
>> proportion of variance for each eigenvector using both methods.
>>
>
> Generally, it is better to standardize. But in some cases, e.g. for the
> same units in your variables indicating also the importance, it might
> make sense not to do so.
> You should think about the analysis, you cannot know which result is
> `better' unless you know an interpretation.
>
>
>
>> I noticed that the proportion of the variance for the first pca without
>> standardizing had a larger value . Is there a meaning to it ? Isnt this
>> always the case?
>> At last , if I am supposed to predict a variable ie weight should I
>> drop
>> the variable ie weight from my data matrix when I do principal component
>> analysis ?
>
>
> This sounds a bit like homework. If that is the case, please ask your
> teacher rather than this list.
> Anyway, it does not make sense to predict weight using a linear
> combination (principle component) that contains weight, does it?
>
> Uwe Ligges
>
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide
> http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
>
>
--
View this message in context: http://old.nabble.com/how-to-tell-if-its-better-to-standardize-your-data-matrix-first-when-you-do-principal-tp26462070p26466400.html
Sent from the R help mailing list archive at Nabble.com.
More information about the R-help
mailing list