[R-SIG-Finance] Coefficients, Principal Component Regression. pcr

Matthieu Stigler matthieu.stigler at gmail.com
Wed Oct 20 16:57:45 CEST 2010


Dear Karla

I have really limited knowledge in pls and pcr models, but I believe one 
still can have inference in those models, since you are doing a 
regression among different factors.

I don't know exactly to what you are applying your models, but another 
framework very similar to PLS/PCR is Structural Equation Modelling, 
which allows to make different regressions among latent variables (and 
nest factor models, maybe even pcr).

If this is useful for you, you might have a look at the openMx R 
project, which has just done an important release (see annoucement 
below) and seems to be very impressive.

Hope this helps

Matthieu

Dear R users,

The OpenMx developer team takes great pride in announcing the availability
of OpenMx 1.0. The team would like to express its gratitude to the large
number of beta-testers who have helped us improve the code. Thank you.

OpenMx is a free suite of R functions and a estimation back-end that
supports fitting a wide variety of structural equation models including
multiple groups, full information ML for missing data, ordinal estimation
with thresholds, multilevel, latent class, and mixture distributions to name
a few.

The most current version of OpenMx (for Mac, Windows and most Linux
variants) may be downloaded by issuing the following R command:

source('http://openmx.psyc.virginia.edu/getOpenMx.R')

OpenMx is not currently hosted on CRAN due to a license restriction on the
one portion of our code, an optimizer, that was not written by the project.
We hope to remedy this situation reasonably soon as we are working on an
open source version of the optimizer.  The remainder of the project is
licensed under Apache 2.0 and the source code may be downloaded from the
OpenMx website and used for any purpose you wish.  We do hope that you wish
to contribute improvements or bug fixes back to the project, but we do not
require it.

The OpenMx website ishttp://openmx.psyc.virginia.edu, where we host a set
of manuals and tutorials, a wiki, and a set of user forums where issues to
do with OpenMx and SEM in general can be discussed.  We require free
registration in order to post to the forums or wiki so as to slow down the
spambots, but everything else is available without registration.

OpenMx has been in development for 3 years.  The OpenMx beta test program
began in October 2009. Since then, we have improved the interface and sped
up the optimization times considerably. Many new features are in the works
and we encourage you to post to the Wish List forum so that we know what you
would like to see.

> From this point on, we will maintain two binary releases: a stable release
numbered as 1.x.x, and a development binary with the latest and greatest
features and bug-fixes, numbered with the current revision number (e.g.,
1448). The development of OpenMx is on-going and we expect that new
development binaries will be released often. The stable releases are likely
to be updated every few months or so.

For the OpenMx team.


Le 18. 10. 10 01:13, Sarbo a écrit :
> Karla,
>
> The mathematics you've outlined are not quite correct. In multiple
> linear regression you are attempting to find the hyperplane that most
> closely approximates the data set that you are dealing with. In PCA, you
> are performing a change of coordinates so that your data are arranged in
> a different Cartesian space. This means that all of your data can now be
> represented as linear combinations of the eigenvectors calculated from
> your correlation (or covariance) matrix. There is no "approximation"
> going on, hence no t-values or F-statistics to calculate goodness of
> fit. You can, however, figure out the explanatory power of each new
> variable pretty easily.
>
> The significance of each of the basis vectors in your eigenspace is
> represented by the absolute magnitude of each eigenvalue. You can figure
> out how much of your data are represented by each eigenvector by
> plotting the cumulative variance explained by the eigenvectors. The
> function "princomp", which is part of the base R distribution these days
> unless I'm very much mistaken, will help you do this using the "plot"
> method.
>
> On Thu, 2010-10-14 at 08:20 -0600, karla hernandez villafuerte wrote:
>
>    
>> Dear Members:
>>
>> I am trying to applied a Principal Component Regression using the pls package and the next formula:
>>
>> PCA<-pcr(Beta~XCT+XCLa+XCLb+XCLc+XCLd+XCLe+XCLf+XCLg+XCLh,
>> data=mydata,ncomp=15,validation="LOO",method="svdpc", model=TRUE, y=TRUE, x=TRUE)
>>
>> where XCT is a matrix of cuantitatives variables and each one of  XCL are cualitatives variables.
>>
>> I can estimate the coefficient matrix using:
>>
>> coef(PCA,comps=5)
>>
>> As far I understand, the PCR perform a multilple regression analysis of the response variable against the reduced set of principal components, using OLS. The results coefficients are transformed into a new set of coefficients (those in coef(PCA,comps=5)) that correspond to a mathematic transformation of the original ones.
>>
>> My question is: Can I see the t values for the final coefficents estimated in coef(PCA,comps=5) or can I have the results of the OLS equation?. In other case which is the best way to analyse the significance of the variables?
>>
>> Thanks!!
>>
>> Karla Hernndez Villafuerte
>>
>>    		 	   		
>> 	[[alternative HTML version deleted]]
>>
>> _______________________________________________
>> R-SIG-Finance at stat.math.ethz.ch mailing list
>> https://stat.ethz.ch/mailman/listinfo/r-sig-finance
>> -- Subscriber-posting only. If you want to post, subscribe first.
>> -- Also note that this is not the r-help list where general R questions should go.
>>      
>
>
> 	[[alternative HTML version deleted]]
>
> _______________________________________________
> R-SIG-Finance at stat.math.ethz.ch mailing list
> https://stat.ethz.ch/mailman/listinfo/r-sig-finance
> -- Subscriber-posting only. If you want to post, subscribe first.
> -- Also note that this is not the r-help list where general R questions should go.
>



More information about the R-SIG-Finance mailing list