[R] Discrepancy in the regression coefficients for Cox regression - PBC data set
David Winsemius
dwinsemius at comcast.net
Fri Nov 21 18:34:08 CET 2008
There is a discussion in Appendix D.3 of "Modeling Survival Data" by
Thereau and Grambsch regarding the differences in the datasets
including the fact that "there was significantly more follow-up for
many patients at the time this dataset was assembled". I do not see a
material difference in the estimates.
--
David Winsemius, MD
Heritage Labs
On Nov 21, 2008, at 12:16 PM, Ravi Varadhan wrote:
> Hi,
>
> When I run the following Cox proportional hazards model on the Mayo
> clinic's
> PBC data set (given in the "survival" package), the regression
> coefficients
> do not agree with the results presented in Table 4.6.3 (p. 195) of
> Fleming &
> Harrington's book.
>
> library(survival)
>
> data(pbc)
>
> ans.cox <- coxph(Surv(time, status) ~ log(bili) + log(alb) + age +
> log(protime) + edema)
>
> ans.cox
>
>> ans.cox <- coxph(Surv(time, status) ~ log(bili) + log(alb) + age +
> log(protime) + edema)
>> ans.cox
> Call:
> coxph(formula = Surv(time, status) ~ log(bili) + log(alb) + age +
> log(protime) + edema)
>
>
> coef exp(coef) se(coef) z p
> log(bili) 0.8975 2.453 0.08271 10.85 0.0e+00
> log(alb) -2.4524 0.086 0.65707 -3.73 1.9e-04
> age 0.0382 1.039 0.00768 4.97 6.5e-07
> log(protime) 2.3458 10.442 0.77425 3.03 2.4e-03
> edema 0.6613 1.937 0.20595 3.21 1.3e-03
>
> Likelihood ratio test=234 on 5 df, p=0 n= 418
>>
>
> These coefficients, however, are significantly different (i.e. the
> differences can't be just attributed to round-off's) from that
> reported in
> Table 4.6.3 (in the "Final model" column) of Fleming and Harrington
> (p.
> 195). The coefficients reported are: 0.8707, -2.533, 0.0394, 2.380,
> 0.8592.
> Note the big difference for the "edema" variable.
>
> It seems like the data set considered in the book and that available
> in
> "survival" package are the same (with n=418).
>
> I also re-ran the Cox PH model with the 2 "data-errors" discussed in
> p.188
> of F&H, but still I could not match the results in Table 4.6.3.
>
> Is it possible that the results could be explained due to difference
> in
> convergence during maximization of partial likelihood?
>
> Can anyone help me figure out why this diescrepancy exists?
>
> Thanks very much,
> Ravi.
> ----------------------------------------------------------------------------
> -------
>
> Ravi Varadhan, Ph.D.
>
> Assistant Professor, The Center on Aging and Health
>
> Division of Geriatric Medicine and Gerontology
>
> Johns Hopkins University
>
> Ph: (410) 502-2619
>
> Fax: (410) 614-9625
>
> Email: rvaradhan at jhmi.edu
>
> Webpage: http://www.jhsph.edu/agingandhealth/People/Faculty/Varadhan.html
>
>
>
> ----------------------------------------------------------------------------
> --------
>
>
>
> [[alternative HTML version deleted]]
>
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
More information about the R-help
mailing list