[R] R - rpart - increasing xerror

Uwe Ligges ligges at statistik.tu-dortmund.de
Sun Aug 19 17:16:34 CEST 2012



On 18.08.2012 21:32, Daniel Blankenheim wrote:
> Hey
> my name is daniel,  i am writing my bachelor thesis and wondering if you can
> help me.
> i am trying to generate a regression tree via  rpart. to
> reduce the error of the model i use cross validation, but instead
> of reducing the cross validation error (xerror) is increasing the more splits
> there are.

... which indicates overfitting. If y is independent of you x(s) the 
best prediction is the overall mean, and that happens with 0 splits.
Anyway, this is somthing to ask your supervisor, since this list is 
about R-help rather than help on statistical modelling.

Best,
Uwe Ligges








> I dont understand what that means. does it mean that my model doesnt fit the
> data- that there is no trend in the data? i am 100% sure that
> there must be a trend or correlation in the data.
>
>>> please help me:)
>
>>>          CP nsplit rel error xerror    xstd
>>> 1 0.100022      0   1.00000 1.0192 0.14222
>>> 2 0.066716      2   0.79996 1.3107 0.18720
>>> 3 0.050471      3   0.73324 1.4127 0.21138
>>> 4 0.033758      4   0.68277 1.5197 0.22826
>>> 5 0.010376      5   0.64901 1.5360 0.23792
>>> 6 0.000010      6   0.63864 1.5419 0.24280
>
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
>




More information about the R-help mailing list