[R] how to extract the variables used in decision tree
David Winsemius
dwinsemius at comcast.net
Wed May 12 00:57:30 CEST 2010
On May 11, 2010, at 6:49 PM, Changbin Du wrote:
> HI, Dear R community,
>
> How to extract the variables actually used in tree construction? I
> want to
> extract these variables and combine other variable as my features in
> next
> step model building.
help(package=rpart, rpart.object)
>
>> printcp(fit.dimer)
>
Perhaps:
fit.dimer$terms
> Classification tree:
> rpart(formula = outcome ~ ., data = p_df, method = "class")
>
> Variables actually used in tree construction:
> [1] CT DP DY FC NE NW QT SK TA WC WD WG WW YG
>
> Root node error: 608/1743 = 0.34882
>
> n= 1743
>
> CP nsplit rel error xerror xstd
> 1 0.185033 0 1.00000 1.00000 0.032726
> 2 0.052632 2 0.62993 0.65132 0.028773
> 3 0.026316 4 0.52467 0.58059 0.027596
> 4 0.025768 5 0.49836 0.57072 0.027419
> 5 0.019737 8 0.42105 0.56086 0.027239
> 6 0.018092 9 0.40132 0.55263 0.027087
> 7 0.013158 12 0.34704 0.52796 0.026616
> 8 0.011513 13 0.33388 0.50000 0.026056
> 9 0.010000 15 0.31086 0.48520 0.025748
>
>
> THanks so much!
>
> --
> Sincerely,
> Changbin
> --
>
> [[alternative HTML version deleted]]
>
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
David Winsemius, MD
West Hartford, CT
More information about the R-help
mailing list