Beruflich Dokumente
Kultur Dokumente
with Nextel
The Context (contd)
Churn
Churner yi = + 1
Non-churner yi = - 1
< 26 55
<3 3
55< & 26
Yes
No
Yes No
Handset price
$150 <$150
No Yes
Bagging and Boosting
f1* x
e.g. tree
f2* x
Aggregating bootstrap samples
f1* x
f2* x
Churn propensity score:
B
1 * x
f3* x
f bag ( x )
B
b
f
b 1
. . . Churn classification:
fB* x
cbag ( x ) sign fbag ( x )
Bagging
Let the calibration sample be Z={(x1,y1), , (xi,yi), , (xN ,yN)}
B bootstrap samples Z b* , b 1, 2, , B
cbag ( x) sign fbag ( x)
Stochastic Gradient Boosting
Winner of the Teradata Churn Modeling Tournament
(Cardell, Golovnya and Steinberg, Salford Systems).
Behavioral predictors
e.g. the average monthly minutes of use
Proportional
Company interactions yi = - 1
Real-life N=100,462 variables
proportion Sample
e.g. mean
of churners = 1.8% unrounded minutes of customer
care calls yi = + 1
Customer demographics Xi=(x1,, x46) yi
e.g. the number of adults in the household
Time
Research Questions
Do bagging (and boosting) provide better results than
other benchmarks?
What are the financial gains to be expected from this improvement?
What are the more relevant churn drivers or triggers that marketers
Donkers et al. 2003; Franses and Paap 2001, p.73-75; Imbens and Lancaster
cbag ( x) sign fbag ( x) B
Assessing the Best Bias Correction
Bias Correction
Risk to churn
10%
Top - decile lift
With 10% = the proportion of churners in this risky segment
And = the proportion of churners in the whole validation set
Financial Gains: Neslin et al. (2004)
Bagging
Stochastic Gradient Boosting
Binary Logit Model
2.4
2.2
Top decile*
+26%
2.0
1.8
1.6
0 20 40 60 80 100
Number of iterations
* Model estimated on the balanced sample, and lift computed on the validation sample.
Validated** Top-Decile Lift
No / Intercept Weighting
Model*
correction correction
Binary logit model 1.775 1.764
Gain = + $ 8,550,000
Gain = + $ 3,214,800
Most Important Churn Triggers
Mean inbound calls less 1 min. Bagging
Mean monthly min. wireless to wireless
Age
Mean monthly min. of use
Mean attempted calls
Months in service
Mean completed calls
Average monthly min. of use (6 months)
Mean overage revenue
Total revenue over life
Mean peak calls
Handset price
Base cost of the calling plan
Change in monthly min. of use
Equipment days
0
20
40
60
80
100
Relative importance
Partial Dependence Plots
Bagging
62
56
60
54
58
52
Probability to churn
Probability to churn
56
50
54
48
52
46
50
44
48
51
50
49
Conclusions: Main Findings
1. Bagging and S.G. boosting are substantially better
classifiers than the binary logit choice model
Improvement of 26% for the top-decile lift,
Profit 1 N 1 LVC 1 1 1 c
LVC of a churner Incentive cost for the Contact
who does not churners retained cost
churn + non-churners
targeted
Top decile 1 1 /
Gain 1- 2 Profit 1 Profit 2
N 1 - 2 LVC
N Top decile LVC