[R] variable selection in logistic
Ben Bolker
bolker at ufl.edu
Thu Sep 3 04:43:56 CEST 2009
David Winsemius wrote:
>
>
> On Sep 2, 2009, at 9:36 PM, annie Zhang wrote:
>
>> Hi, R users,
>>
>> What may be the best function in R to do variable selection in
>> logistic
>> regression?
>
> PhD theses, and books by famous statisticians have been pursuing the
> answer to that question for decades.
>
>> I have the same number of variables as the number of samples,
>> and I want to select the best variablesfor prediction. Is there any
>> function
>> doing forward selection followed by backward elimination in stepwise
>> logistic regression?
>
> You should probably be reading up on penalized regression methods. The
> stepwise procedures reporting unadjusted "significance" made available
> by SAS and SPSS to the unwary neophyte user have very poor statistical
> properties.
>
> --
>
> David Winsemius, MD
> Heritage Laboratories
> West Hartford, CT
>
>
I would start with Frank Harrell's book: loads of practical, but rigorous,
advice.
@book{harrell_regression_2001,
title = {Regression Modeling Strategies},
isbn = {0387952322},
publisher = {Springer},
author = {Harrell, Frank},
year = {2001}
}
"As many variables as samples" is particularly scary.
--
View this message in context: http://www.nabble.com/variable-selection-in-logistic-tp25268519p25268984.html
Sent from the R help mailing list archive at Nabble.com.
More information about the R-help
mailing list