[R] sdev value returned by princomp function (used for PCA)

Sarah Goslee sarah.goslee at gmail.com
Thu Jun 30 22:10:07 CEST 2011


Hi,

What you want is this:

> myPCA.cor <- princomp(scaledUSArrests, cor=TRUE)
> myPCA.cor$sdev
   Comp.1    Comp.2    Comp.3    Comp.4
1.5748783 0.9948694 0.5971291 0.4164494
>

or alternatively

> myPCA2 <- prcomp(scaledUSArrests)
> myPCA2$sdev
[1] 1.5748783 0.9948694 0.5971291 0.4164494

See this message for more detail:

https://stat.ethz.ch/pipermail/r-help/2002-October/025556.html

Sarah

On Thu, Jun 30, 2011 at 3:48 PM, Coghlan, Avril <A.Coghlan at ucc.ie> wrote:
>
> Dear all,
>
> I have a question about the 'sdev' value returned by the princomp function (which does principal components analysis).
>
> On the help page for princomp it says 'sdev' is 'the standard deviations of the principal components'.
>
> However, when I calculate the principal components for the USArrests data set, I don't find this to be the case:
>
> Here is how I calculated the principal components and got the 'sdev' values:
>> scaledUSArrests <- scale(USArrests) # standardise the variables to have variance 1 and mean 0
>> myPCA <- princomp(scaledUSArrests)  # do the PCA
>> myPCA$sdev
>   Comp.1    Comp.2    Comp.3    Comp.4
> 1.5590500 0.9848705 0.5911277 0.4122639
>
> As far as I understand, the principal components themselves are stored in the 'scores' value returned by princomp, so I calculated the standard deviations of those values to see if they agree with 'sdev':
>> sd(myPCA$scores)
>   Comp.1    Comp.2    Comp.3    Comp.4
> 1.5748783 0.9948694 0.5971291 0.4164494
>
> I get different values than I got using sd(myPCA$scores) and myPCA$sdev, why is this?
>
> As there are 4 standardised variables, and the variance of each standardised variable is 1, I would expect the variances of the principal components to add to 1. I find that this is true for the variances calculated using sd(myPCA$scores) but not myPCA$sdev:
>> (1.5590500^2) + (0.9848705^2) + (0.5911277^2) + (0.4122639^2)
>  3.92
>> (1.5748783^2) + (0.9948694^2) + (0.5971291^2) + (0.4164494^2)
>  4.00
>
> I am wondering why are the values in 'sdev' not equal to the standard deviations of the principal components (as stored in 'scores'), and why is the total variance calculated by summing the squared 'sdev' values not equal to 4?
>
> Sorry if I have misunderstood something obvious. I would be grateful for help as I'm a bit confused..
>
> Kind regards,
> Avril
>
> Avril Coghlan
> Cork, Ireland
>

-- 
Sarah Goslee
http://www.functionaldiversity.org



More information about the R-help mailing list