[R] rpart with interval censored data crashes R

David Winsemius dwinsemius at comcast.net
Sat Jan 10 14:42:46 CET 2009


On a Leopard Mac with the Urbanek compiled 64 bit R, one sees this:

 > library(rpart)
 > library(survival)
Loading required package: splines
 > fit<-rpart(Surv(N,Y,type="interval2")~Salt+pH+Temp, data=myD)

  *** caught segfault ***
address 0x0, cause 'memory not mapped'

Traceback:
  1: .C(C_rpartexp2, as.integer(length(dtimes)),  
as.double(dtimes),     as.double(.Machine$double.eps), keep =  
integer(length(dtimes)))
  2: (get(paste("rpart", method, sep = ".")))(Y, offset, , wt)
  3: rpart(Surv(N, Y, type = "interval2") ~ Salt + pH + Temp, data =  
myD)

Possible actions:
1: abort (with core dump, if enabled)
2: normal R exit
3: exit R without saving workspace
4: exit R saving workspace

Choosing "4" does save the workspace.
-- 
David Winsemius

On Jan 9, 2009, at 9:04 AM, Keith Jewell wrote:

> Hi Everyone,
>
> This example code results in R 'crashing'; that is the R application  
> closes
> with no warnings or error messages.
> #-----------------------
> myD <- read.table(stdin(), header=TRUE, nrows=20)
> Broth Salt   pH Temp    N  Y Growth
> 1        310    9.0 2.92      10 90.0 NA            0
> 2        615    6.0 7.82      30  1.0  2            1
> 3        217    2.0 7.34      10  7.0  8            1
> 4        338   10.0 4.44      10 90.0 NA            0
> 5        240    4.0 7.33      10 20.0 21            1
> 6        336   10.0 3.90      10 90.0 NA            0
> 7        279    7.0 6.73      10 90.0 NA            0
> 8       1021    9.0 5.03      45  8.0  9            1
> 9        974    7.0 4.01      45 90.0 NA            0
> 10       265    7.0 2.93      10 90.0 NA            0
> 11       934    4.0 5.28      45  0.1  1            1
> 12       669    9.0 5.03      30 90.0 NA            0
> 13       875   10.0 6.24      37  1.0  2            1
> 14       385    2.0 5.84      20  1.0  2            1
> 15       562    2.0 5.84      30  0.1  1            1
> 16       718    0.5 5.54      37  0.1  1            1
> 17       845    9.0 5.03      37  3.0  6            1
> 18       913    2.0 5.84      45  0.1  1            1
> 19       577    4.0 4.10      30 90.0 NA            0
> 20        20    0.5 7.44       8 24.0 27            1
>
> library(rpart)
> library(survival)
> fit<-rpart(Surv(N,Y,type="interval2")~Salt+pH+Temp, data=myD)
> #---------------------
>
> Professor Ripley helpfully pointed out that the documentation does  
> not say
> that interval censoring is supported, and indeed this seems only to  
> happen
> with interval censored data.
>
> ?rpart indicates that the dependent variable may be a survival object.
> Neither ?rpart nor "An Introduction to Recursive Partitioning Using  
> the
> RPART Routines" (Therneau et al 1997) suggest that the dependent  
> variable
> may contain interval censored data, but neither do they suggest it
> shouldn't; i.e. as far as I'm aware (!) this restriction is not  
> documented.
>
> This post has three purposes:
>
> 1) Bring this behaviour - especially the crash in response to 'bad'  
> data -
> to the attention of the authors.
>
> 2) Seek an explanation of the restriction (if intentional). In my
> simplicity, it seems that interval censored data should be easier to  
> handle
> than left or right censored - after all the information content is  
> greater.
>
> 3) Seek guidance on how to work around the problem. I'm minded to  
> replace
> the interval censored data by the mid points of the intervals. Does  
> anyone
> have any comments on such an approach?
>
> Any comments gratefully received.
>
> Keith Jewell
> ==========================================
> Version:
> platform = i386-pc-mingw32
> arch = i386
> os = mingw32
> system = i386, mingw32
> status = Patched
> major = 2
> minor = 8.1
> year = 2009
> month = 01
> day = 07
> svn rev = 47502
> language = R
> version.string = R version 2.8.1 Patched (2009-01-07 r47502)
>
> Windows Server 2003 x64 (build 3790) Service Pack 2
>
> Locale:
> LC_COLLATE=English_United Kingdom.1252;LC_CTYPE=English_United
> Kingdom.1252;LC_MONETARY=English_United
> Kingdom.1252;LC_NUMERIC=C;LC_TIME=English_United Kingdom.1252
>
> Search Path:
> .GlobalEnv, package:stats, package:graphics, package:grDevices,
> package:utils, package:datasets, package:methods, Autoloads,  
> package:base
>
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.




More information about the R-help mailing list