[R] Apparently Conflicting Results with coxph

Kevin E. Thorpe kevin.thorpe at utoronto.ca
Mon Oct 1 15:57:27 CEST 2007


Peter Dalgaard wrote:
> Kevin E. Thorpe wrote:
>> Dear List:
>>
>> I have a data frame prepared in the couting process style for including
>> a binary time-dependent covariate.  The first few rows look like this.
>>
>>     PtNo Start    End Status Imp
>> 1      1     0  608.0      0   0
>> 2      2     0  513.0      0   0
>> 3      2   513  887.0      0   1
>> 4      3     0   57.0      0   0
>> 5      3    57  604.0      0   1
>> 6      4     0  150.0      1   0
>>
>>
>> The outcome is mortality and the covariate is for an implantable
>> defibrillator, so it is expected that the implant would reduce the
>> risk of death.  The results of fitting coxph (survival package) are:
>>
>> Call:
>> coxph(formula = Surv(Start, End, Status) ~ Imp, data = nina.excl)
>>
>>
>>      coef exp(coef) se(coef)     z    p
>> Imp 0.163      1.18    0.485 0.337 0.74
>>
>> Likelihood ratio test=0.11  on 1 df, p=0.738  n= 335
>>
>> Since this was unexpected, I created a non-counting process data
>> frame with an indicator variable representing received an implant
>> or not.  Here are the results:
>>
>> Call:
>> coxph(formula = Surv(Days, Dead) ~ Implant, data = nina.excl0)
>>
>>
>>          coef exp(coef) se(coef)     z       p
>> Implant -1.77     0.171    0.426 -4.15 3.3e-05
>>
>> Likelihood ratio test=19.1  on 1 df, p=1.21e-05  n= 197
>>
>> I found this degree of discrepancy surprising, especially the point
>> estimate of the coefficient.  I have verified the data frames are
>> set up correctly.
>>
>> Here is what I have tried to understand what is going on.
>>
>> I tried fitting models adjusted for other covariates that I have in
>> the data frame.  This did not appreciably affect the coefficients
>> for the implant variable.
>>
>> I ran cox.zph on the two models shown above and plotted the results.
>> In both cases, the point estimate of Beta(t) is sort of parabolic
>> in that the curves are monotonically increasing to a local maximum
>> after which they are monotonically decreasing (the CIs are a bit
>> more wiggly).
>>
>> I would interpret this to mean that the effect of implant is probably
>> time-dependent.  If so, how do I actually get a "proper" estimate of
>> beta(t) for a variable like this?
>>
>> Are there some other things I should look at to understand what's
>> going on?
>>
>>   
> If you want to play with time-dependent regression coefficients have a
> look at the timereg package and the book that it supports.
> 
> However, first you need to consider the possibility of selection effects
> that can take place even with non-varying effects. In the case at hand I
> would suspect a bias created by the fact that you don't implant devices
> into people who are already dead.
> 

Thanks.  The point in your last paragraph did cross my mind too.

-- 
Kevin E. Thorpe
Biostatistician/Trialist, Knowledge Translation Program
Assistant Professor, Department of Public Health Sciences
Faculty of Medicine, University of Toronto
email: kevin.thorpe at utoronto.ca  Tel: 416.864.5776  Fax: 416.864.6057



More information about the R-help mailing list