[R] stack overflow and predict()

Liaw, Andy andy_liaw at merck.com
Sat Nov 8 01:24:25 CET 2003


Try something like this (suppose x is the matrix of predictors in the
training set, and xtest is the same for the test set):

my.rp <- rpart(y ~ x, ...)
test.pred <- predict(my.rp, newdata=data.frame(x=I(xtest)))

Make sure the name of the variable in the data frame given to newdata
matches the name of the variable in the original formula, in this case `x',
a matrix.

HTH,
Andy


> From: Ji Zhu [mailto:jizhu at umich.edu] 
> 
> Dear R users,
> 
> I'm trying to use rpart() to build a classification tree on a 
> big dataset. The number of samples is n=100 and the number of 
> variables is p=10000.
> 
> At first I stored all the data in a data.frame and got a 
> "stack overflow" error; then I changed the data into a matrix 
> and the problem disappeared. Now the trouble is when I try to 
> use the predict() function, since each newdata is a long list 
> with p=10000 elements, the predict() function doesn't 
> recognize it and simply returns the fitted values at the 
> training data (rather than the newdata).
> 
> Could anyone give me some suggestion on how to proceed?  Thank you.
> 
> Regards,
> 
> Ji
> 
> ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
> Ji Zhu                          439 West Hall
> Assistant Professor             550 East University
> Department of Statistics        Ann Arbor, MI 48109
> University of Michigan          (734) 936-2577 (O)
> jizhu at umich.edu                 (734) 763-4676 (F)
> ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~




More information about the R-help mailing list