[R] [R-pkgs] glmpath (new version 0.91)
Trevor Hastie
hastie at stanford.edu
Thu Mar 2 02:27:23 CET 2006
We have uploaded to CRAN a new version of glmpath, a package
which fits the L1 regularization path for generalized linear models.
The revision includes:
- coxpath, a function for fitting the L1-regularization path for the Cox
ph model;
- bootstrap functions for analyzing sparse solutions;
- the ability to mix in L2 regularization along with L1 (elasticnet).
We have also completed a report that describes these methods in detail:
http://www-stat.stanford.edu/~hastie/Papers/glmpath.pdf
The lars package in R fits the entire piecewise-linear L1 regularization
path for
the lasso. The coefficient paths for L1 regularized glms, however,
are not piecewise linear.
glmpath uses convex optimization - in particular predictor-corrector
methods -
to fit the coefficient path at important junctions. These junctions
are at the "knots" in |beta|
where variables enter/leave the active set; i.e. nonzero/zero values.
Users can request greater resolution at a cost of more computation,
and compute values
on a fine grid between the knots.
The code is fast, and can handle largish problems efficiently.
it took just over 4 sec system cpu time to fit the logistic
regression path for
the "spam" data from UCI with 3065 training obs and 57 predictors.
For a microarray example with 5000 variables and 100 observations, 11
seconds cpu time.
Currently glmpath implements binomial, poisson and gaussian families.
The additional coxpath algorithm
extends the concepts to survival data via the L1-regularized Cox
proportional hazards model.
We have added bootstrap functions, which help make sense of
automatically chosen models (via AIC). Shake up the data,
and a different model might be chosen. The bootstrap is a natural way of
measuring the strength (significance) of variables,
and more.
Mixing in a small amount of L2 regularization brings a lot of stability
to the lasso; for example, without it the limiting
sequence for a logistic regression model with separable data is
undefined. We have added this option to
glmpath and coxpath
Mee Young Park and Trevor Hastie
Update details:
### glmpath 0.90 -> glmpath 0.91 ###
<Major changes>
1. Elastic net penalty added
2. Coxpath functions: coxpath, cv.coxpath, plot.coxpath, predict.coxpath
2. Bootstrap functions: bootstrap.path, plot.bootpath
<Minor changes>
glmpath:
actions[[1]] label
subtract 1 from every id number in actions
bshoot.threshold = 0.1 instead of 0.5
relax.lambda = 1e-8 instead of 1e-9
re-center the intercept in b.corrector/b.predictor
fix gaussian AIC/BIC
corrector1:
fix the first element of AIC/BIC
[[alternative HTML version deleted]]
_______________________________________________
R-packages mailing list
R-packages at stat.math.ethz.ch
https://stat.ethz.ch/mailman/listinfo/r-packages
More information about the R-help
mailing list