Beruflich Dokumente
Kultur Dokumente
a framework of 7 steps
for prediction models
Ewout Steyerberg
Professor of Medical Decision Making
Dept of Public Health, Erasmus MC, Rotterdam, the Netherlands
Overview
Background: Oberwolfach in the mountains
A framework to develop prediction models
Potential usefulness
Discussion: how to improve prediction research
Oberwolfach
Go cross-country skiing
Sell book
Presentation options
Theoretical challenges
Practical challenges
2. Methodological problems
Missing values
Optimal recoding and dichotomization
Stepwise selection, relatively small data sets
Presentation
Validation
Potential solutions
Awareness and education
Scientific progress required
Translation to practice
Epidemiologists/clinicians interested in prediction modeling
Statisticians not interested in prediction modeling
Reporting guidelines
Not yet available
Study protocol registration
Possible, rare
http://www.clinicalpredictionmodels.org
http://www.springer.com/978-0-387-77243-1
Prognostic
modelling checklist:
intended to assist in
developing a
valid prediction
model
Usefulness of framework
Checklist for model building
SMART data, survival after cardiovascular event, 2008
Critical assessment of model building
GUSTO-I model, Lee 1995
Predictors
2. Coding of predictors
continuous predictors
linear and restricted cubic spline functions
truncation of values (for example for systolic blood pressure)
categorical variables
Detailed categorization for location of infarction:
anterior (39%), inferior (58%), or other (3%)
Ordinality ignored for Killip class (I IV)
class III and class IV each contained only 1% of the patients
3. Model specification
Main effects: .. which variables were most strongly related to shortterm mortality:
hypothesis testing rather than prediction question
Interactions: many tested, one included: Age*Killip
Linearity of predictors:
transformations chosen at univariate analysis were also used in
multivariable analysis
4. Model estimation
Standard ML
No shrinkage / penalization
No external information
5. Model performance
Discrimination
AUC
Calibration: observed vs predicted
Graphically, including deciles
(links to Hosmer-Lemeshow goodness of fit test)
Specific subgroups of patients
Calibration
Calibration
6. Model validation
10-fold cross validation
100 bootstrap samples
model refitted, tested on the original sample
7. Model presentation
Predictor effects:
Relative importance: Chi-square statistics
Relative effects: Odds ratios graphically
Predictions
Formula
A
p
l
i
c
a
t
i
o
n
a
r
e
a
C
a
l
i
b
r
a
t
i
o
n
D
i
s
c
r
i
m
i
n
a
t
i
o
n
C
l
i
n
i
c
a
l
u
s
e
f
u
l
n
e
s
P
uT
baP
h
e
a
l
t
h
r
g
e
t
i
n
g
o
f
p
r
e
v
e
n
t
i
v
e
i
n
t
e
r
v
e
n
t
i
o
n
s
d
c
t
i
n
c
i
d
n
t
d
s
a
s
e
x
X
x
C
lD
inT
iacegasntlopsrrtdiacertw
e
o
r
k
u
p
i
n
g
X
x
X
tT
S
a
r
i
n
g
t
e
a
t
m
e
n
t
hS
eInutregnpisecuatylicdoefdcetircseiaositnonm
m
a
k
i
n
g
a
k
i
n
g
X
x
X
e
n
t
D
l
a
y
i
n
g
t
r
e
a
m
e
n
t
X
x
X
R
eC
sInoecavluracsrihaotneiandajuR
C
T
X
x
X
s
t
m
e
n
t
i
n
a
R
C
T
X
n
f
o
u
n
d
r
a
d
j
u
s
t
m
e
t
w
i
t
h
a
p
r
o
p
e
n
s
i
t
y
s
c
o
r
e
C
ase-m
ixadjustm
ent
Which performance measure when?