[R] Fwd: Rpart help
kristen wissmar
wissmar.kristen at gmail.com
Wed May 24 03:45:50 CEST 2017
Hi R users!
I'm new to R, so I'm starting with a basic exercise in rpart.
I'm predicting if a user will churn based on past order history. I've
calculated the probabilities in excel, and if user is a single order
customer (1), then their probability of churn is 90%, if there are multiple
orders(0) then the probability of churning is 70%. In the R model, the
probability looks like it's 100% and 53%. In excel I used the count of
shopper_key to calculate probabilities. So I'm wondering if R has needs a
shopper_key to count?
It would be helpful if someone could suggest where I'm going wrong.
Thank you!
Code -
m1 <- rpart( churn ~ single_order , data = data2, method="anova" )
Output-
n= 22041
node), split, n, deviance, yval
* denotes terminal node
1) root 22041 3229.265 0.8216959
2) single_order< 0.5 8407 2092.852 0.5325324 *
3) single_order>=0.5 13634 0.000 1.0000000 *
shopper_key churn single_order
1 1 0
2 1 1
3 0 0
4 1 0
5 1 1
6 1 1
7 1 0
8 1 1
9 0 1
10 1 1
[[alternative HTML version deleted]]
More information about the R-help
mailing list