[R] Coding of categorical variables for logistic regression?

David Winsemius dwinsemius at comcast.net
Sun Mar 28 19:30:29 CEST 2010

On Mar 28, 2010, at 1:15 PM, Ravi Kulkarni wrote:

> Hello,
>  I am trying to do a logistic regression and have one predictor  
> variable
> (x) that is ratio and two predictor variables (y and z) that are
> categorical. These have three levels each which I have called "High",
> "Medium" and "Low".
>  My question: do I need to use a numerical coding scheme for the
> categorical variables as required by some statistical software  
> packages,
> with some sort of numeric dummy-variable coding?

No. If you have constructed those variables as factors, the regression  
functions in R will interpret them correctly, i.e. as though the  
dummies were in there. If you have not constructed them as factors,  
you should do so now.


>  I am using:
>                 glm(binvar~x+y+z, family=binomial(link="logit"))
>  Thanks,
>    Ravi Kulkarni
> -- 

David Winsemius, MD
West Hartford, CT

More information about the R-help mailing list