[R-sig-ME] Little variability in outcome; "pwrssUpdate did not converge"

Wed Mar 25 01:06:41 CET 2015

Thanks for your response! I'd prefer to model this the same way I did in three other populations (with lower means and sample sizes) for the sake of presentation and comparability. The basic idea (sorry that wasn't clear) is a sibling control design, examining the effect of paternal age within families (i.e. no marginal models for me).

I'm not sure I understand how I could estimate offsets separately from the conditional analysis. I've tried including only families with at least two sibs (nope), but wouldn't selecting based on the outcome introduce bias? How would I remedy that?

My previous mail contained a mis-specified model, since that happened to give any output and I thought it might be informative. 
It also had a odd prior specification. The default specification is c(10,2.5). Unthinkingly, I set a very high SD on the slopes i.e. c(9,9). That's not a good idea since these high SDs on the normal put a lot of weight on 0 and 1 on the logit (there's a section on this in 2.6. of the MCMCglmm course notes).
Unfortunately, even though I do get improved results with small subsamples (30k) using the default prior spec (as opposed to vanilla glmer), the models still do not converge with the 3.5m dataset. 

I was thinking that I might get closer by simply splitting my sample? I'm of course still hoping there's some control I've missed.

> On 25 Mar 2015, at 00:01, David Duffy <David.Duffy at qimr.edu.au> wrote:
> 
> On Mon, 23 Mar 2015, Ruben Arslan wrote:
> 
>> I have a dichotomous outcome (child mortality) with a very high mean
>> (0.9946) in a large dataset (3.5m).
>> 
>> I thought maybe the problem still is complete separation and I'm just being
>> too timid with the blme prior.
>> 
>> Oddly (maybe not), the only model where I do get convergence is one where I
>> accidentally mis-specified my sample, so my outcome was censored (hence the
>> mean but not the intercept was lower). I'm attaching the model.
> 
> The misspecified model? Maybe you should be doing something else, such as bivariate logistic (dropping extra offspring) or marginal models? If you are interested just in familial aggregation, you can do the conditional analysis using just the ~18000 odd families with one or more events, using the other families just to estimate offsets.
> 
> A few random thoughts ;)
> 
> | David Duffy (MBBS PhD)
> | email: David.Duffy at qimrberghofer.edu.au  ph: INT+61+7+3362-0217 fax: -0101
> | Genetic Epidemiology, QIMR Berghofer Institute of Medical Research
> | 300 Herston Rd, Brisbane, Queensland 4006, Australia  GPG 4D0B994A