Beruflich Dokumente
Kultur Dokumente
Hein Putter
Outline
Landmarking
Background
Dynamic prediction
Why dynamic prediction?
Illustration
Multi-state approach
Dynamic prediction and landmarking
Basic idea
Landmark (super) models
Landmarking in action
Landmarking and competing risks
Competing risks
Dynamic pseudo-observations
Discussion
Dynamic prediction Hein Putter
Landmarking Dynamic prediction Dynamic prediction and landmarking Landmarking and competing risks Discussion
Background
Landmarking
Origin of landmarking
I Origin: debate on the effect of response to chemotherapy
on survival (Anderson JR, Cain KC, Gelber RD, 1983, J
Clin Oncol 1, 710-719)
I Common way of analysis: make two groups, a "responder"
group and a "non-responder" group and compare survival
between these two groups
I Problem with this approach: a potential responder will only
belong to the "responder" group if he/she survives until
time of response
I Individuals in the responder group are immortal for some
time, this gives them an unfair survival advantage:
immortal time bias
Dynamic prediction Hein Putter
Landmarking Dynamic prediction Dynamic prediction and landmarking Landmarking and competing risks Discussion
Background
Time-dependent covariates
Background
Correct approaches
Dynamic prediction
Illustration
Illustration
Illustration
Covariates
Covariate N (%)
Gender
Male 1564 (39%)
Female 2468 (61%)
Education
Less than high school 1736 (43%)
High school 1212 (30%)
Some college 1084 (27%)
BMI
≤ 25 2244 (56%)
25 – 30 1388 (34%)
> 30 390 (10%)
Smoking
Never 1997 (50%)
Past 1683 (42%)
Current 324 (8%)
Multi-state approach
Multi-state approach
I Multi-state model; process X (t) in time, taking values
I 0: alive without ADL disability
I 1: alive with ADL disability
I 2: dead
0. Healthy 1. Disabled
2. Death
Multi-state approach
Multi-state approach
I Multi-state model; process X (t) in time, taking values
I 0: alive without ADL disability
I 1: alive with ADL disability
I 2: dead
0. Healthy 1. Disabled
2. Death
Multi-state approach
Healthy−>Disabled
Healthy−>Dead
Disabled−>Healthy
Disabled−>Dead
5
4
Cumulative hazard
3
2
1
0
75 80 85 90 95 100 105
Age
Multi-state approach
Otherwise
I If the Markov assumption does not hold, we would have to
use (micro-)simulation
I In our data, there is evidence that history of ADL-disability
increases disability rate (⇒ violation of Markov assumption)
Basic idea
Basic idea
Basic idea
Basic idea
Basic idea
Robustness
Combining information
β(s) = β0 + β1 s
How to implement it
Baseline hazards
I Baseline hazards for different landmark time points s may
be combined
I To add more structure and to make it easier to interpret the
models
I We may assume a model
λ0 (t | s) = λ0 (t) exp(θ(s))
with θ(s0 ) = 0 for identifiability
I In our application we take
θ(s) = θ1 s + θ2 s2
I Model can be fitted directly by applying a simple Cox
model to the stacked data set
I Landmark time s not used as stratifying variable but as
covariate
Dynamic prediction Hein Putter
Landmarking Dynamic prediction Dynamic prediction and landmarking Landmarking and competing risks Discussion
Landmarking in action
Set-up
I Endpoint is survival in a window of fixed width w = 10
years from the moment of prediction
I Landmark time points used: 16 points, equally spaced,
from age 75 to age 90
I For each landmark (prediction) time point, construct
landmark data set, containing all relevant information
needed for the prediction
I In all data sets we take all patients still at risk (alive),
compute the current value of ADL-disability, and set the
horizon at thor = tLM + 10 years
I At each landmark point we fit a simple Cox model on
(tLM , thor ) and use that to obtain a prediction of survival at
thor + 10
Landmarking in action
2000
ADL disabled
Healthy
1500
1000
500
0
75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90
Landmarking in action
Regression coefficients
Regression coefficients with 95% confidence intervals
75 80 85 90
SMOpast SMOcurrent
1.5
1.0
0.5
0.0
−0.5
−1.0
BMI325−30 BMI3>30 ADLADL disabled
Regression coefficient
1.5
1.0
0.5
0.0
−0.5
−1.0
educ3HighSchool educ3SomeCollege sexfemale
1.5
1.0
0.5
0.0
−0.5
−1.0
75 80 85 90 75 80 85 90
Landmark age
Landmarking in action
Landmarking in action
1.2
2.5
1.0
0.8
Cumulative hazard
2.0
exp(theta(s))
0.6
0.4
1.5
0.2
1.0
0.0
75 80 85 90 95 75 80 85 90
Landmarking in action
Regression coefficients
Blue lines are the (landmark-varying) supermodel effect
estimates 75 80 85 90
SMOpast SMOcurrent
1.5
1.0
0.5
0.0
−0.5
−1.0
Regression coefficient
Landmark age
Landmarking in action
0.8
0.6
Probability of surviving next 10 years
0.4
0.2
0.8
0.6
0.4
0.2
75 80 85 90 75 80 85 90
Age at prediction
Landmarking in action
Software
dynpred
I It is not so difficult to write your own code in the statistical
package of your choice
I In R, package dynpred is available on CRAN
(cran.r-project.org)
I The companion package of the book "Dynamic Prediction
in Clinical Survival Analysis" by Hans van Houwelingen and
myself (Chapman & Hall)
I Functions available to create landmark data sets, applying
administrative censoring at horizon (cutLM ), and to
calculate dynamic "death within window" curves (Fwindow)
I On the book website
http://www.msbi.nl/DynamicPrediction, R code
(using the dynpred package) of all the analyses in the
book is available for download
Dynamic prediction Hein Putter
Landmarking Dynamic prediction Dynamic prediction and landmarking Landmarking and competing risks Discussion
Competing risks
Data (EBMT)
I 5582 CML patients with transplantation (SCT) between
1997 and 2003
I Two competing risks: "relapse" (Rel) and "non-relapse
mortality" (NRM)
Cumulative incidences
1.0
Cumulative incidences
0.8 RFS
0.6
NRM
0.4
0.2
Relapse
0.0
0 2 4 6 8 10 12
Competing risks
Events of interest
I Objective:
I To give prognosis of the disease/recovery process after
SCT for a patient with a given post-transplant history
I Prediction to be based on covariates
I Baseline: year of SCT, risk score (low, medium, high)
I Time-dependent: Acute Graft-versus-Host-Disease
(aGvHD)
aGvHD
I Low or high grade
1.0
Cumulative incidences
0.8
0.6
0.4
High grade
0.2
Low grade
0.0
0 1 2 3 4
Competing risks
Time-dependent covariates
I Define
Objective
I For a patient, alive without relapse at time s after SCT, with
given covariates Z ∗ (s), what is the probability that he/she
will
I have had relapse before time s + w
I have died without relapse before time s + w
I In our application: w is five years
Competing risks
I J types of failure
I T̃ time of failure, C censoring time, T = min(T̃ , C)
I D: type of failure, ∆ = D if failure occurred, 0 otherwise
I Z (t) vector of covariates (possibly time-dependent)
I We observe (Ti , ∆i , Zi (·)) for individual i
I Assume censoring is independent of (T̃ , D) given
covariates
I Cumulative incidences
Competing risks
Competing risks
Dynamic pseudo-observations
Yi = 1{Ti ≤ s + w, Di = j}
Dynamic pseudo-observations
(delta-method)
Dynamic pseudo-observations
Dynamic pseudo-observations
I Recall
Yi = 1{Ti ≤ s + w, Di = j}
I Unfortunately, not available for censored patients !
I Define the dynamic pseudo-observation
(−i)
θ̂is = ns F̂j (s + w|s) − (ns − 1)F̂j (s + w|s),
where
X dij
F̂j (s + w|s) = · Ŝ(ti − |s),
ni
s<ti ≤s+w
with
I dij = no of cause j events at ti ; ni = no at risk at ti
Dynamic pseudo-observations
θ̂is = 1{Ti ≤ s + w, Di = j}
Dynamic pseudo-observations
θ̂is = 1{Ti ≤ s + w, Di = j}
With censoring
I (P1) θ̂is is asymptotically independent of θ̂ls for individuals
i 6= l as ns → ∞
I (P2) θ̂is is asymptotically independent of θ̂ls0 for individuals
0
i 6= l and landmark time points s 6= s as ns , ns0 → ∞
I (P3) E θ̂is |Ti >
s, Zi equals asymptotically its theoretical
counterpart E 1{Ti ≤ s + w, Di = j}|Ti > s, , Zi as ns → ∞
(Follow more or less directly from results in Graw et al. (2009).)
Dynamic prediction Hein Putter
Landmarking Dynamic prediction Dynamic prediction and landmarking Landmarking and competing risks Discussion
Dynamic pseudo-observations
Censored data
Fixed s
I Yi = 1{Ti ≤ s + w, Di = j}, i ∈ Ds not observed
I Retain score equations and replace yi by
pseudo-observations θ̂is
I The quasi-score equation is
ns
X ∂µi (β) 1
· · [θ̂is − µi ] = 0
∂β µi (1 − µi )
i=1
Dynamic pseudo-observations
(delta-method)
Dynamic pseudo-observations
Dynamic pseudo-observations
Super models
I Define µik = µi (sk ) = P(Ti ≤ sk + w, Di = j | Ti > sk )
I GLM
g(µik |Zi (sk )) = β0 (sk ) + β > (sk )Zi (sk ),
I Choose smooth model for l th component of β(s)
Dynamic pseudo-observations
with β(s)
b = H(s)β̂
I Its variance is estimated consistently by
dg −1 (x) 2
· (Z∗ )> · H(s) · var(
c β) b · H(s)> · Z∗
dx b > Z∗
|x=(β)
Dynamic pseudo-observations
Scatter-plot
Relapse
0.0 0.5 1.0 0.0 0.5 1.0
1.2
0 months
0.6
0.984 0.961 0.947 0.913
0.0
●
●
●
●
●
●
●
●●●
●
3 months
●
●
●●
●
●
●
●
●●
●
●
●●
●●
●
●
●
●
●
●●
●
●
●
●●
●
●●
●
●
●
●
●●
●
●●
●
●
●
1.0
●
●●
●
●
●
●
●
●●
●
●●
●●
●●
●
●
●
●
●
●●
●
●●
●●●
●
●●
●
●●
●
●
●●
●●
●
●
●
●●
●
●●
●●
●
●
●●
●
●
●●
●
●●
●
●
●●
●
●
●
●●
●
●●
●
●
●
●
●
●●
●●
●
●●
●●
●●
●●
●●
●●
●●
●
●●
●
●
●●
●●
●
●
0.0
●
●●●
●
●●
●
●
●
●
●●
●●
●
●
●
●●
●
●
●●
●
●
●
●●
●
●
●●
●●
●●
●
●
●
●●
●●
●
●
●●
●●
●
●●
●
●●
●●
●
●
●●
●
●●
●●
●
●●
●●
●
●●
●
●●
●
●●
●●
●
●
● ●
●
●
● ● ●
●
●
● ●
●
●●●
●
● ●
●
●● ●●
6 months
●
●
●● ●●
●
●
●
●●
●
●● ●
●
●●
●
●
●●
●● ●●
●
●
●
●● ●
●●
●●
●
●
● ●
●●
●
●
●
●
●
●●
● ●
●
●
●●
●
●
●●
● ●
●●
●
●
●●
●
●● ●
●
●
●●
●● ●●
1.0
●
●
●●
● ●
●●
●
●
●●
●
●
● ●
●
●●
●
●
●
●
●●
● ●
●●
●
●
●
●
●●
●
●
●
●● ●●
●
●●
●
●
●
●
●●
●
●●
●
● ●●
●
●
●●
●
●
●
●●
●
●●
●●
●
●
●●
● ●●
●
●●
●
●●
●
●●
●
●●
● ●●
●
0.992 0.970
0.5
●
●
●
●● ●
●●
●
●
●
●●
●
● ●●
●
●●
●
●●
●
●●
● ●●
●
●
●●
●
●●
●
●
●●
●●
●
●● ●
●●
●●
●●
●
●
●●
●
●●●
●
●●
●
●●
●●
● ●●●
●
●
●
●●
●●
●
●
●
●
●
●●
●
●● ●
●●
●
●●
●
●●
●
●
●●
● ●●
●
0.0
●●
●
●
●●
● ●●
●
●●
●
●
●
●●
●
●●
●
●
●● ●
●●
●
●●
●●
●●
●
●●
●●
●
●
● ●
●
●●●
●
●
●
●●
●
●●
●●
● ●●
●●
●●
●●
●
●●
●
●●●
●●
● ●●
●
●●
●●
●
●
●
●●
●
●
●●
●●
● ●●
●
●●
●
●●
●
●
●
●●
●
●●
●● ●
●●
●
●●
●
●●
●●
●
●
●
●
●
●●
●
●●
● ●
●
●●
●
●●
●
●
● ●
● ●
●
●
● ●
● ●
●
●●
●
● ● ●
● ●
●
●
● ●
●
●●● ●
●
●●●
9 months
●
●
●● ●
●
●● ●
●
●●
●
●
●● ●●
●
● ●
●
●●
●
●●
●
● ●
●
●●
● ●●
●
●
●●
●● ●●
●
● ●
●
●●
●
●● ●
●● ●●
●
●●
●
●
● ●
●●
●
● ●
●
●●
●
●
●
●
●●
● ●
●
●
●● ●●
●
●●
●
●
●●
● ●
●●
●
● ●
●●
●
●
●
●● ●●
● ●
●●
1.0
●●
●
●●
●● ●
●
●●
●
●● ●
●
●●
●
●●
●
●●
●
● ●
●
●
●● ●●
●
●
●
●●
●
●
●● ●
●●
●
●
● ●
●
●●
●
●
●
●●
●
●●
● ●
●
●●
●
●● ●
●
●
●●●
●●
●
●
●
●● ●
●●
●
●
●
●● ●
●●
●
●
●
●●
●
●●
●
●
●●
●●
●
●● ●●
●
●●
●
●
●● ●●
●
●●
●
●
●●
● ● ●
0.980
0.5
●●
●
●●
● ●
●
●●
●
● ●
●
●
●
●
●
●
●●
●
●
●● ●●
●
●
●
●● ●
●●
●
●
●●
●
●
●●
●●
●●
●●
● ●
●●
●
●●
●●
● ●●
●
●●
●●
●
●
●●
●●
●●
●
●●
●
●●
●● ●●
●●
●●
●●
●
●●
●● ●
●
●
●●●
●●
●●
●
●
●
●
●
●●
● ● ●
●●
●
●
●●
● ●
●●
●
●
●●
●●
●
●●
● ●●
●
●●
● ●
●
●●
●
0.0
●●
●
●
●●
● ●●
●
●●
●
● ●●
●
●●
●
●
●
●●
●
●●
●
●
●● ●
●●
●
●●
●●
●● ●
●●
●●
●
●
●●
●
●●
●●
●
●
● ●
●
●●●
●
● ●●●
●
●
●
●
●
●●
●
●●
●●
● ●
●●
●●
●
●●
● ●
●
●
●●
●
●●
●
●●
●
●
●●●
●●
●● ●
●●
●●
●
●●
●●
● ●
●●
●●
●●
●
●
●
●
●●
●
●●
●●
● ●●
●
●●
●● ●
●●
●
●●
●
●●
●
●●
●●
●
●●
●● ●●
●●
●
●
●
●
●●
●●
● ●
●●
●●
●
●●
●
●
●
●
●●
●
●
●●
● ●
●
●●
●
●
●●
● ●●
●●
●
●●
●●
●
1.5
● ● ● ●
●
● ●
● ●
● ●
●
● ● ● ●
●
● ●
● ●
● ● ●
● ● ● ● ●●
●
12 months
●
● ●
●
● ●
●
●● ●
●
●●
● ●
● ●
● ●
●
●
● ●
●
●●● ●
●
●●● ●
●
●●●
●
●
●● ●
●
●● ●
●
●● ●●
●
●
●
●
●
●● ●
●●
●
● ●●
●
●● ●●
●●
●
●●
●
● ●
●
●●
● ●●
●
● ●●
●
●●
●● ●●
●
● ●
●
●● ●
●
●●
● ●
●● ●●
●● ●
●●
● ●
●●
●
●
●
●●
● ●
●
●●
● ●
●
●
●● ●●
●
●●
●
●●
●
●
● ●●
●
●
● ●
●●
●
● ●●
●
●
● ● ● ●●
1.0
●
●●
●
● ●●
●
●● ●●
●
●● ●●
●
●
●●
●●
●●
● ●
●
●
●●
●
● ●
●
●
●●
●
● ●●
●
●●
●
●
●
●●
●
● ●●
●
●
●● ●●
●
●
●● ●●
●
●●
●●
●
●
●● ●
●●
●
● ●
●●
●
● ●
●●
●
●
●
●●
●
● ●
●
●●
● ●
●
●●
● ●
●
●●
●
●●
●
●●
●●
● ●
●●
●●
●
●
● ●
●●
●●
●
●
● ●
●●
●●
●
●
●
●
●●
●
● ●
●●
● ●
●●
● ●
●●
●
0.5
●
●
●●
●
● ●
●●
●
●
● ●●
●
●
● ●
●●
●
●
●
●●
●
●●
●
●●
●●
●
●●
● ●●●
●●
●
●●
●●
●
●● ●
●
●●
●●
●
●●
●
●
● ●●●
●
●●
●●
●
●
●
●
●
●
●●●
●●
● ●
●
●●
●●
●
●
●● ●
●●●
●
●
●
●● ●
●●●
●
●
●
●●
●●
●●
● ●
●●
●●
●● ●
●
●●
●
● ●
●●
●
●
0.0
●
●
●●
●
●●
● ●●
●
●
●●
●● ●●
●●
●
●
● ●●
●
●●
●
●
●
●●
●
●●
●
●●
●
●● ●●
●
●●
●
●●
●●
●
● ●
●
●●
●
●
●●
●●
●
● ●
●
●
●●
●●
●
●
●
●
●
●●
● ●●
●
●
●● ●
●
●
●●
● ●
●
●●
●
●
●●
●
●●
●
●
●●
●
●●
● ●●●●
●
●
●●
●●
●
●● ●
●●●
●
●
●●
●
●● ●●●
●
●
●●
●
●●
●
●●
●
●
●●
● ●
●●
●● ●
●●
●●
●
●
●●
●● ●●
●●
●
●●
●
●
●●
● ●●
●●
●
●●
●
●
●●
●
●●
●●
●
●●
●
●● ●
●
●
●
●●
● ●
●● ●●
●
●●
●
●●
● ●
●●
●●
●●
●
●
●
●
●
●●
● ●
●
●
●
●
●●
●
● ●
●●
●
●
●
●
●
●●
●
● ●●●
●
●
●
●●
●
● ●●
● ●
●●
● ●
●●
●
0.0 0.6 1.2 0.0 0.5 1.0 0.0 0.5 1.0 1.5
Dynamic pseudo-observations
Dynamic pseudo-observations
1
Low risk Low risk
Medium risk Medium risk
High risk High risk
Regression coefficients
● ● ●
● ●
●
● ● ●
●
● ●
●
● ●
● ●
● ● ● ●
●
−1
● ● ● ● ● ●
● ● ● ● ●
● ● ● ● ●
●
● ● ●
● ● ● ●
●
● ●
●
● ●
● ●
●
● ● ●
−2
● ● ●
● ●
● ●
●
● ●
● ● ●
● ●
●
●
●
●
●
−3
Dynamic pseudo-observations
3
Separate models Separate models
Supermodel Supermodel
2
● ● ● ●
●
Regression coefficients
●
●
● ● ●
●
●
1
0
−1
● ● ● ●
● ● ●
● ●
● ● ●
−2
−3
Dynamic pseudo-observations
1.0
No aGvHD No aGvHD No aGvHD
Low grade Low grade Low grade
High grade High grade High grade
0.8
Predicted probabilities at s + 5 years
Relapse
Cif1.pred.s2no
Cif1.pred.s3no
0.6
● ●
● ●
● ●
0.4
● ●
● ●
● ●
● ● ● ● ●
● ● ● ●
● ● ● ●
● ● ● ● ●
● ● ● ● ● ●
● ● ● ● ● ●
● ● ● ● ●
● ● ● ●
0.2
● ● ● ● ● ●
● ● ● ● ● ● ● ● ● ● ●
● ● ● ● ● ● ● ●
● ● ● ● ● ● ● ●
● ● ●
● ● ● ● ● ● ●
● ● ● ● ● ● ● ● ● ● ● ●
● ● ● ● ●
0.0
1.0
●
●
●
Cif2.pred.s2no
Cif2.pred.s3no
0.6
● ●
●
● ●
NRM
● ● ●
● ● ●
0.4
● ●
● ● ● ●
● ●
● ● ● ●
●
● ● ● ● ● ●
● ● ● ● ●
● ● ●
● ● ● ●
● ● ● ● ● ● ●
0.2
● ● ● ● ● ● ●
● ● ● ● ● ● ●
● ● ● ● ● ● ● ● ● ●
● ● ● ● ● ● ● ●
● ● ● ● ●
● ● ● ● ● ● ●
● ● ● ● ●
● ● ● ● ●
● ● ● ●
0.0
0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0
Discussion
Discussion (general)
Advantages of standard approach
I It is standard
I There is software
I You gain biological understanding (hopefully) by modeling
the effects of covariates on transitions
Advantages of landmarking
I It is more direct; no need for complicated formulas for
prediction
I Predictions obtained from multi-state model may be off the
mark if assumptions are violated or if model fit is not good
I Sparse model (considerably fewer parameters than
multi-state model)
References
Cortese, G. and Andersen, P. K. (2009).
Competing risks and time-dependent covariates.
Biom J 32: 138–158.
Graw, F. and Gerds, T. A. and Schumacher, M. (2009).
On pseudo-values for regression analysis in multi-state models.
Lifetime Data Anal 15: 241–255.
van Houwelingen, H. C. (2007).
Dynamic prediction by landmarking in event history analysis.
Scand J Stat 34: 70–85.
Kurland, B. F. and Heagerty, P. J. (2005).
Directly parameterized regression conditioning on being alive: analysis of
longitudinal data truncated by deaths.
Biostat 6: 241–258.
Nicolaie, M. A. and van Houwelingen, H. C. and de Witte, T. M. and Putter,
H. (2012).
Dynamic prediction in competing risks by landmarking.
Stat Med, in press.
References
H. C. van Houwelingen (2007).
Dynamic prediction by landmarking in event history analysis.
Scand. J. Stat. 34: 70–85.
H. C. van Houwelingen and H. Putter (2008).
Dynamic predicting by landmarking as an alternative for multi-state modeling: an
application to acute lymphois leukemia data.
Lifetime Data Anal. 14: 447–463.
H. C. van Houwelingen and H. Putter (2012).
Dynamic Predicting in Clinical Survival Analysis.
Chapman & Hall.
D. Y. Lin and L. J. Wei (1989).
The robust inference for the Cox proportional hazards model.
JASA 84: 1074–1078.
R. Xu and J. O’Quigley (2000).
Estimating average regression effects under non-proportional hazards.
Biostatistics 1: 423–439.
Y. Y. Zheng and P. J. Heagerty (2005).
Partly conditional survival models for longitudinal data.
Biometrics 61: 379–391.
Dynamic prediction Hein Putter
Landmarking Dynamic prediction Dynamic prediction and landmarking Landmarking and competing risks Discussion