[R-sig-ME] zero-inflated-count-data?

Highland Statistics Ltd highstat at highstat.com
Mon Feb 26 18:26:02 CET 2018


----------------------------------------------------------------------

Message: 1
Date: Mon, 26 Feb 2018 14:57:00 +0100
From: "C. AMAL D. GLELE" <altessedac2 at gmail.com>
To: Jonathan Judge <bachlaw01 at outlook.com>
Cc: Ben Bolker <bbolker at gmail.com>,  R SIG Mixed Models
	<r-sig-mixed-models at r-project.org>
Subject: Re: [R-sig-ME] zero-inflated-count-data?
Message-ID:
	<CANrzCv0SZxAXjoftdkN7v5M4g6wrd3GM7qx23dFB=fi7JHisCg at mail.gmail.com>
Content-Type: text/plain; charset="utf-8"

Hi, dear all.
Many thanks to you all for your very helpful answers.
Jonathan,
I've started fitting a model using zeroinfl function from pscl package, but
I'm having the following

difficulty according to one of my regressors, let be H_var (categorical
with 8 levels):
as regressors, I have 7 categorical variables (with a total of 26 levels)
and two numerical

variables;
1) when I fit the model like follows,
model1<-zeroinfl(countdata~var1+H_var+var3+var4+var5+var6+var7+var8num

+var9num,dist="negbin",data=mydata)
, I receive the error message below:
"Error in solve.default(as.matrix(fit$hessian)) :
   system is computationally singular: reciprocal condition number =
7.05621e-21
In addition: Warning message:
glm.fit: fitted probabilities numerically 0 or 1 occurred
"
2)
but, if I remove H_var from the count component and fits model2 loke
follows,
model2<-zeroinfl(countdata~var1+var3+var4+var5+var6+var7+var8num+
var9num|H_var,dist="negbin",data=mydata)
  the model fits well and I do not receive error message anymore.
3)
If use H_var in both component of the model, like follows,
model3<-zeroinfl(countdata~var1+var3+var4+var5+var6+var7+var8num+
var9num+H_var|H_var,dist="negbin",data=mydata)
I receive the following error message:
"Error in solve.default(as.matrix(fit$hessian)) :
   system is computationally singular: reciprocal condition number =
4.2618e-20
"
Question:
  Does someone have any idea about probables causes of the problems posed
at points 1) and 3) ?






Without seeing the data......simplify your model? Collinearity? Start simple and build up the complexity of the model.
Maybe start with a Poisson GLM and figure out whether you really need a ZIP/ZINB? Why are you actually do a ZINB?






can you, please, provide me details (some ways to do it) and/or lead about
simulating data from a fitted model?





See step 10 in:

A protocol for conducting and presenting results of regression-type analyses (2016).
Zuur & Ieno.

http://onlinelibrary.wiley.com/doi/10.1111/2041-210X.12577/abstract

and see Figure 8 from that paper for an example. R code is somewhere online as well.


Alain







  In advance, thanks for your answers.
Best,

2018-02-25 23:55 GMT+01:00 Jonathan Judge <bachlaw01 at outlook.com>:
-- 

Dr. Alain F. Zuur
Highland Statistics Ltd.
9 St Clair Wynd
AB41 6DZ Newburgh, UK
Email: highstat at highstat.com
URL:   www.highstat.com

And:
NIOZ Royal Netherlands Institute for Sea Research,
Department of Coastal Systems, and Utrecht University,
P.O. Box 59, 1790 AB Den Burg,
Texel, The Netherlands



Author of:
1. Beginner's Guide to Spatial, Temporal and Spatial-Temporal Ecological Data Analysis with R-INLA. (2017).
2. Beginner's Guide to Zero-Inflated Models with R (2016).
3. Beginner's Guide to Data Exploration and Visualisation with R (2015).
4. Beginner's Guide to GAMM with R (2014).
5. Beginner's Guide to GLM and GLMM with R (2013).
6. Beginner's Guide to GAM with R (2012).
7. Zero Inflated Models and GLMM with R (2012).
8. A Beginner's Guide to R (2009).
9. Mixed effects models and extensions in ecology with R (2009).
10. Analysing Ecological Data (2007).



More information about the R-sig-mixed-models mailing list