[R] Error using newdata argument in survfit

Hanke, Alex HankeA at mar.dfo-mpo.gc.ca
Thu Jun 16 14:23:34 CEST 2005


Hi Brian,
The factor Prior.f has 5 levels (1,2,3,4,5) which coxph deals with by
creating 4 dummy variables coded with 1 or zero. That's what I see when I
look at fit$x.
fit$x[1:5,]
   Week LagAOO factor(Prior.f)2 factor(Prior.f)3 factor(Prior.f)4
31   22      0                0                0                0
32   22      0                0                0                0
33   22      2                0                0                0
34   22      3                0                0                0
35   22      2                0                0                0
   factor(Prior.f)5
31                0
32                0
33                0
34                0
35                0 
I have played with the formula a bit adding a term at a time and then
checking to see if I can produce the survival curves for pseudo cohorts. I
get as far as the Prior.f term and am successful if I treat it as a
continuous variable. If I introduce it as a factor and assume it wants four
dummy variables as above I get the variable lengths error. If I represent
the term with one variable:
survfit(fit,list(Week=c(15,15),LagAOO=c(0,0),Prior.f=c(1,2)))
I get:
Error in x2 %*% coef : non-conformable arguments
Which is a nice change but still short of knowing what is going on.
 Regards
Alex 

-----Original Message-----
From: Prof Brian Ripley [mailto:ripley at stats.ox.ac.uk] 
Sent: June 15, 2005 4:24 PM
To: Hanke, Alex
Cc: 'r-help at stat.math.ethz.ch'
Subject: Re: [R] Error using newdata argument in survfit

You appear to have a coding for prior.f in newdata rather than the factor 
itself.

It's a bit hard to be sure when we don't have data8 to compare with.

On Wed, 15 Jun 2005, Hanke, Alex wrote:

> Dear R-helpers,
> To get curves for a pseudo cohort other than the one centered at the mean
of
> the covariates, I have been trying to use the newdata argument to survfit
> with no success. Here is my model statement, the newdata and the ensuing
> error. What am I doing wrong?
>
>> summary(fit)
> Call:
> coxph(formula = Surv(Start, Stop, Event, type = "counting") ~
>    Week + LagAOO + Prior.f + cluster(interaction(Station, Year)),
>    data = data8, method = "breslow", x = T, y = T)
>
>  n= 1878
>            coef exp(coef) se(coef) robust se     z       p
> Week     0.00582      1.01   0.0323    0.0239 0.244 8.1e-01
> LagAOO   0.71929      2.05   0.1238    0.1215 5.918 3.3e-09
> Prior.f2 0.12927      1.14   0.4402    0.4025 0.321 7.5e-01
> Prior.f3 0.79082      2.21   0.5484    0.4460 1.773 7.6e-02
> Prior.f4 2.04189      7.71   0.6008    0.4685 4.358 1.3e-05
> Prior.f5 1.20450      3.34   0.6423    0.5481 2.198 2.8e-02
>
>         exp(coef) exp(-coef) lower .95 upper .95
> Week          1.01      0.994     0.960      1.05
> LagAOO        2.05      0.487     1.618      2.61
> Prior.f2      1.14      0.879     0.517      2.50
> Prior.f3      2.21      0.453     0.920      5.29
> Prior.f4      7.71      0.130     3.076     19.30
> Prior.f5      3.34      0.300     1.139      9.76
>
> Rsquare= 0.047   (max possible= 0.25 )
> Likelihood ratio test= 91  on 6 df,   p=0
> Wald test            = 209  on 6 df,   p=0
> Score (logrank) test = 142  on 6 df,   p=0,   Robust = 17.4  p=0.00803
>
>  (Note: the likelihood ratio and score tests assume independence of
>     observations within a cluster, the Wald and robust score tests do
not).
>>
> newdat
>      Week   LagAOO Prior.f2 Prior.f3 Prior.f4 Prior.f5
> 1 17.55218 1.191693        1        0        0        0
> 2 17.55218 1.191693        0        0        0        0
>
>> survfit(fit,newdata=newdat)
> Error in model.frame(formula, rownames, variables, varnames, extras,
> extranames,  :
>        variable lengths differ
> In addition: Warning message:
> 'newdata' had 2 rows but variable(s) found have 1878 rows
>
> Regards,
> Alex
>
>
> Alex Hanke
> Department of Fisheries and Oceans
> St. Andrews Biological Station
> 531 Brandy Cove Road
> St. Andrews, NB
> Canada
> E5B 2L9
>
>
>
> 	[[alternative HTML version deleted]]
>
> ______________________________________________
> R-help at stat.math.ethz.ch mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide!
http://www.R-project.org/posting-guide.html
>

-- 
Brian D. Ripley,                  ripley at stats.ox.ac.uk
Professor of Applied Statistics,  http://www.stats.ox.ac.uk/~ripley/
University of Oxford,             Tel:  +44 1865 272861 (self)
1 South Parks Road,                     +44 1865 272866 (PA)
Oxford OX1 3TG, UK                Fax:  +44 1865 272595




More information about the R-help mailing list