[R] results of a survival analysis change when converting the data to counting process format
Ferenci Tamas
t@m@@@|erenc| @end|ng |rom med@t@t@hu
Sun Aug 18 19:10:12 CEST 2019
Dear All,
Consider the following simple example:
library( survival )
data( veteran )
coef( coxph(Surv(time, status) ~ trt + prior + karno, data = veteran) )
trt prior karno
0.180197194 -0.005550919 -0.033771018
Note that we have neither time-dependent covariates, nor time-varying
coefficients, so the results should be the same if we change to
counting process format, no matter where we cut the times.
That's true if we cut at event times:
veteran2 <- survSplit( Surv(time, status) ~ trt + prior + karno,
data = veteran, cut = unique( veteran$time ) )
coef( coxph(Surv(tstart,time, status) ~ trt + prior + karno, data = veteran2 ) )
trt prior karno
0.180197194 -0.005550919 -0.033771018
But quite interestingly not true, if we cut at every day:
veteran3 <- survSplit( Surv(time, status) ~ trt + prior + karno,
data = veteran, cut = 1:max(veteran$time) )
coef( coxph(Surv(tstart,time, status) ~ trt + prior + karno, data = veteran3 ) )
trt prior karno
0.180197215 -0.005550913 -0.033771016
The difference is not large, but definitely more than just a rounding
error, or something like that.
What's going on? How can the results get wrong, especially by
including more cutpoints?
Thank you in advance,
Tamas
More information about the R-help
mailing list