[R] Stepwise
Ben Bolker
bolker at ufl.edu
Thu Sep 4 19:40:53 CEST 2008
Peter Flom <peterf <at> brainscope.com> writes:
>
> Robin Williams wrote
> <<<<
> Is there any facility in R to perform a stepwise process on a model,
> which will remove any highly-correlated explanatory variables? I am told
> there is in SPSS. I have a large number of variables (some correlated),
> which I would like to just chuck in to a model and perform stepwise and
> see what comes out the other end, to give me an idea perhaps as to which
> variables I should focus on.
> Thanks for any help / suggestions.
> >>>
>
> Stepwise is a bad method of selecting variables. Far better methods are LASSO
and LAR (least angle
> regression), available in the LARS package and the LASSO2 package.
>
> However, while both these methods are good, neither is a substitute for
substantive knowledge.
>
> Also, the key thing is not so much whether variables are correlated, but
whether they are co-linear, which
> is different. If you have a great many variables, then you can have a high
degree of colinearity even with no
> high pairwise correlations. I've not done this in R, but
>
> RSiteSearch("collinearity", restrict = 'functions') yields 34 hits.
>
> HTH
>
> Peter
>
Another suggestion would be to do PCA on the predictor variables.
And to read Frank Harrell's book on _Regression modeling strategies_.
cheers
Ben Bolker
More information about the R-help
mailing list