[R] Why the order of parameters in a logistic regression affects results significantly?

Qinghua He qinghua.he at yahoo.com
Fri Jul 22 00:04:22 CEST 2016


Using the same data, if I ran
fit2 <-glm(formula=AR~Age+LumA+LumB+HER2+Basal+Normal,family=binomial,data=RacComp1)summary(fit2)exp(coef(fit2)) 
I obtained:
> exp(coef(fit2))(Intercept)         Age        LumA        LumB        HER2       Basal      Normal  0.24866935  1.00433781  0.10639937  0.31614001  0.08220685 20.25180956          NA 
while if I ran

fit2 <-glm(formula=AR~Age+LumA+LumB+Basal+Normal+HER2,family=binomial,data=RacComp1)summary(fit2)exp(coef(fit2))
I obtained:
> exp(coef(fit2)) (Intercept)          Age         LumA         LumB        Basal       Normal         HER2   0.02044232   1.00433781   1.29428846   3.84566516 246.35185956  12.16443690           NA 

Essentially they're the same model - I just moved HER2 to the last. But the OR changed significantly. Can someone explain?
For the latter result, I don't even know how to interpret as all factors have OR>1 (except Intercept), how could that possible? Can I eliminate the effect of intercept?
Also, I cannot obtain OR for the last factor due to collinearity. However, I know others obtained OR for all factors for the same dataset. Can someone tell me how to obtain OR for all factors? All factors are categorical variables (i.e., 0 or 1).
Thanks!
Peter
	[[alternative HTML version deleted]]



More information about the R-help mailing list