[R] Declaring All Variables as Factors in GLM()
Leonardo Ferreira Fontenelle
leonardof at leonardof.med.br
Sat Apr 30 13:47:57 CEST 2016
This should do the trick:
history2 <- as.data.frame(lapply(history, as.factor))
Mind you that read.csv() by default reads string vectors as factors, so
that declaring the variables as factors should only be necessary for the
numeric ones, like income. Using as.factor() in factor variables may
drop unused levels, but in your case I believe it won't be a problem.
HTH,
Leonardo Ferreira Fontenelle
http://lattes.cnpq.br/9234772336296638
Em Sáb 30 abr. 2016, às 04:25, Preetam Pal escreveu:
> Hi guys,
>
> I am running glm(y~., data = history,family=binomial)-essentially,
> logistic
> regression for credit scoring (y = 0 or 1). The dataset 'history' has 14
> variables, a few examples:
> history <- read.csv("history.csv". header = TRUE)
> 1> 'income = 100,200,300 (these are numbers in my dataset; however
> interpretation is that these are just tags or labels,for every
> observation,
> its income gets assigned one of these tags)
> 2> 'job' = 'private','government','unemployed','student'
>
> I want to declare all the regressors and y variables *as factors*
> programmatically. Would be great if anyone can help me with this (idea is
> to loop over variable names and use as.factor - but not sure how to do
> this). Thanks
>
> Regards,
> Preetam
> --
> Preetam Pal
> (+91)-9432212774
> M-Stat 2nd Year, Room No.
> N-114
> Statistics Division, C.V.Raman
> Hall
> Indian Statistical Institute, B.H.O.S.
> Kolkata.
>
> [[alternative HTML version deleted]]
>
> ______________________________________________
> R-help em r-project.org mailing list -- To UNSUBSCRIBE and more, see
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide
> http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
More information about the R-help
mailing list