[R] Penalized Canonical Variates

Frank E Harrell Jr f.harrell at vanderbilt.edu
Thu Dec 21 15:50:27 CET 2006


In the excellent paper by Hastie, Buja, and Tibshirani "Penalized 
Discriminant Analysis" the authors developed penalized discriminant 
functions that incorporated shrinkage on the predictor parameters.  This 
is a shrunken version of a canonical correlation analysis in which dummy 
variables appear on the left hand side.  Canonical variates are 
frequently overfitted and in some cases shrinkage is needed 
simultaneously on the left and right hand sides.  For example, one may 
have a multi-group discrimination problem where some of the groups have 
low frequencies and need to borrow information from the other groups. 
As another example, if one generated data from the linear model Y = X + 
residual and found optimum transformations of X and Y that maximized R^2 
  using canonical variates allowing for quadratic transformations, a b c 
d are solved for in the multivariate regression aY^2 + bY = cX^2 + dX. 
  Without penalization, the fitted model will be too nonlinear for small 
sample sizes.  Penalizing nonlinear terms would help.  Does anyone know 
of a method or code that does both-sides penalization for canonical 
variates (multivariate least-squares regression)?

Thanks
Frank
-- 
Frank E Harrell Jr   Professor and Chair           School of Medicine
                      Department of Biostatistics   Vanderbilt University



More information about the R-help mailing list