[R] Factors and Multinomial Logistic Regression

peter dalgaard pdalgd at gmail.com
Thu May 2 22:04:26 CEST 2013


On May 2, 2013, at 20:33 , Lorenzo Isella wrote:

> On Wed, 01 May 2013 23:49:07 +0200, peter dalgaard <pdalgd at gmail.com> wrote:
> 
>> It still doesn't work!!!!!
>> 
> 
> 
> Apologies; since I had already imported nnet in my workspace, the script worked on my machine even without importing it explicitly (see the script at the end of the email).
> Sorry for the confusion.

You still owe us an answer why you thought that this:

Coefficients:
     (Intercept)     science       socst femalefemale
low     1.912288 -0.02356494 -0.03892428   0.81659717
high   -4.057284  0.02292179  0.04300323  -0.03287211

Std. Errors:
     (Intercept)    science      socst femalefemale
low     1.127255 0.02097468 0.01951649    0.3909804
high    1.222937 0.02087182 0.01988933    0.3500151

Residual Deviance: 388.0697 

is at all different from the Stata output. As far as I can tell it is EXACTLY the same!

Apologies for being insistent, but this will come up in Internet searches as "I couldn't make R do what Stata does".

> 
> I now mainly have a question about a definition: I can easily calculate the relative risk ratio (RRR) and its confidence interval (CI) for a given variable of my multinomial regression by exponentiating the variable and its original CI.
> However, how is the standard error on the RRR defined? This is now the only part of the stata calculation which I cannot reproduce.
> Cheers
> 

They would appear just to be delta-method based. 

s.e.(f(thetahat)) =~ f'(thetahat) s.e.(thetahat)

in casu f() is exp() and, e.g., looking at the coef. for female in the "low" table:

> .3909813 * exp(.8166202)
[1] 0.8847277

(It is a pretty useless quantity. Stata itself doesn't use it for much, either.)

> cc <- summary(mymodel)
> exp(cc$coefficients) * cc$standard.errors
     (Intercept)    science      socst femalefemale
low   7.62989469 0.02048619 0.01877141    0.8847053
high  0.02115184 0.02135577 0.02076329    0.3386964



> Lorenzo
> 
> ##############################################################################################
> 
> 
> 
> library(foreign)
> library(nnet)
> ## See the Stata example at http://bit.ly/11VG4ha
> 
> mydata <- read.dta("http://www.ats.ucla.edu/stat/data/hsb2.dta")
> 
> 
> sex <- rep(0, dim(mydata)[1])
> 
> sel <- which(mydata$female=="male")
> 
> sex[sel] <- 1
> 
> mydata$sex <- sex
> 
> ## IMPORTANT: redefine the base line!!!
> 
> mydata$ses2 <- relevel(mydata$ses, ref = "middle")
> 
> 
> ## NB: for some reason, if I use female (a factor assuming two values)
> ## I do not reproduce the results of the example.
> ## I need to use a variable which is numeric and assumes two values
> ## (that is why I introduced the variable sex))
> 
> ## mymodel <- multinom(ses2 ~ science+ socst+ sex, data=mydata)
> 
> 
> mymodel <- multinom(ses2 ~ science+ socst+ female, data=mydata)
> 
> 
> 
> 
> print(summary(mymodel))
> 
> print("The relative risk ratio (RRR) is, ")
> 
> print(exp(coef(mymodel)))

-- 
Peter Dalgaard, Professor,
Center for Statistics, Copenhagen Business School
Solbjerg Plads 3, 2000 Frederiksberg, Denmark
Phone: (+45)38153501
Email: pd.mes at cbs.dk  Priv: PDalgd at gmail.com



More information about the R-help mailing list