[R-sig-ME] Structural zeros in lme4

Tue Oct 5 16:01:37 CEST 2010

On Tue, Oct 5, 2010 at 1:06 AM, Lydia Belton <lbelton at zoology.up.ac.za> wrote:
> Dear R users,
> I am working with a binary data set of categorical variables with repeated
> measurements on 100 individuals over 24 trials with follwing variables

> success =0 or 1
> age = j, s, y or a
> ra = l, m, h
> sex= m or f

> With the data layed out as below
>
> success   age    ra    sex   ind trial
> 1         a      l     m     1     1
> 1         s      m     f     2     1
> 0         y      h     f     3     1
> ....................................
> ....................................
> ....................................
> 0         a      l     m     1     24
> 0         s      m     f     2     24
> 1         y      h     f     3     24
> When I try to run the following model lme4

> success~(age + ra + sex)^2 + (1|ind), binomial, verbose=T)
> I get the following error message

> 0:           nan: 0.411061 -2.99573 -0.587787  14.0386  14.3205 -12.5703
> -11.9282  11.3544  12.2767  1.07485  1.33312  13.4198      nan      nan
> -11.0981 -14.4590 -13.2578  1.18634      nan
> Error in asMethod(object) : matrix is not symmetric [1,2]
> In addition: Warning message:
> In mer_finalize(ans) : gr cannot be computed at initial par (65)

> >From reading previous posts I'm assuming this is caused by empty cells
> that are produced by the interactions in the data set.

> However these empty cells are inherent in the data. Whilst all
> combinations of sex and age are possible not all combinations or ra and
> age, or ra and sex, can exist.

> Specifically ra = l and sex = f
>             ra = l and age = s
>             ra = l and age = j cannot occur.

> If the interactions with ra are removed the model runs. However
> interactions between ra, age and sex are theoretically possible. From
> looking at the raw data and running a glm without individual as a random
> effect it seems like there might be a significant interaction between ra
> and age.

> Does anyone know if there is a way to test this without collapsing my data?

I don't know of an easy way of doing that but, as you are the second
person in as many weeks to pose such a question, it is obviously a
situation that we should accommodate.

The glm function uses a modified version of some classical linear
algebra software (Linpack, which predated Lapack) specifically to
handle rank deficient cases.  These arise most commonly from missing
cells, as above.  The way the calculations are done in lmer and glmer
we don't have the option of using that decomposition that detects rank
deficiency.  We may be able to modify the code to do a preliminary
decomposition using the modified Linpack routine and take corrective
action if it is found to be rank deficient.  I regret to say that I
won't be able to do so soon as I am swamped by my teaching obligations
at present.