[R] Warning message as a result of logistic regression performed

Gavin Simpson gavin.simpson at ucl.ac.uk
Wed May 27 16:24:35 CEST 2009


Try reading this thread:

http://thread.gmane.org/gmane.comp.lang.r.general/134368/focus=134475

especially the posts by I Kosmidis which show you how to diagnose
problems in logit model fits like this.

There is a statement about this warning in ?glm as well and a pointer to
a reference which discusses a source of the warning.

G

On Wed, 2009-05-27 at 11:22 +0100, Winter, Katherine wrote:
> I am sorry if this question sounds basic but I am having trouble understanding a warning message I have been receiving in R after attempting logistic regression.
> 
> I have been using the logistic regression function in R to analyse a simulated data set. The dependent variable "failure" has an outcome of either 0 (success) or 1 (failure). Both the independent variables have been previously generated in a mathematical model and stored in a data.frame for analysis. I am currently using a sample size of 1000 and I use the following commands in R:
> 
>        log.reg.1 <- glm(failure ~ age +weight +init.para.log.value +k.d1,family=binomial(logit), data=test)
> 	log.reg.1.summary <- summary(log.reg.1); print(log.reg.1.summary)
> 	log.reg.1.exp <- exp(log.reg.1$coef); print(log.reg.1.exp)
> 
> When I execute these commands I get the following warning message: 
> 
> "In glm.fit(x = X, y = Y, weights = weights, start = start, etastart = etastart,  :fitted probabilities numerically 0 or 1 occurred"
> 
> I am unsure what this warning is referring to. I have tried using google to answer this question but have had no luck. 
> 
> I have been on the following website https://stat.ethz.ch/pipermail/r-sig-ecology/2008-July/000278.html but found it was not helpful as I when I ran the example given I received no warning message (I am using R version 2.8.1). 
> 
> I am working with simulated data so there are no missing values in the data set.
> 
> I have also looked at the following website http://tolstoy.newcastle.edu.au/R/help/05/07/7759.html they suggest that the warning is as a result of "perfect separation" of the results (a possibility with simulated data). However, when I added an extra row to my data.frame of results that I knew to be false and hence to prevent "perfect separation" subsequent logistic regression still resulted in the same warning message. 
> 
> I am still at a loss as to the meaning of this message and any help in understanding this warning would be much appreciated.
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
-- 
%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%
 Dr. Gavin Simpson             [t] +44 (0)20 7679 0522
 ECRC, UCL Geography,          [f] +44 (0)20 7679 0565
 Pearson Building,             [e] gavin.simpsonATNOSPAMucl.ac.uk
 Gower Street, London          [w] http://www.ucl.ac.uk/~ucfagls/
 UK. WC1E 6BT.                 [w] http://www.freshwaters.org.uk
%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%~%

-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 197 bytes
Desc: This is a digitally signed message part
URL: <https://stat.ethz.ch/pipermail/r-help/attachments/20090527/25eba3a6/attachment-0002.bin>


More information about the R-help mailing list