Sie sind auf Seite 1von 8

FormulaeforcalculatingIDIandNRI

WorkedexampleofIDIandNRIuse
This example uses data from an article published in the BMJ,
1
investigating if a new risk
factor, the metabolic syndrome, adds to prediction of total mortality above and beyond
established risk factors (smoking, diabetes, hypertension, total cholesterol) that are already
used in the primary preventive clinical setting. The question was researched in 2322 50-year-
old men of the Uppsala Longitudinal Study of Adult Men cohort. Two logistic regression
models with total mortality as outcome (1078 died) were investigated and compared; one
small model with only the established risk factors as exposures, and one larger model with
established risk factors plus a metabolic syndrome variable as exposures. Please see the
original article for a detailed description.
1
The Ethics Committee of Uppsala University
Hospital approved the study, and written informed consent was obtained from all
participants.
The variables used in this example are abbreviated such: mortality = mortality, smo =
smoking, diab = diabetes, htn = hypertension, chol = cholesterol, mets = the metabolic
syndrome (the new risk factor; a single variable in this example, but the software could
equally well handle multiple new risk factors simultaneously).
No established thresholds for total mortality risk over 30 years for middle-aged men exist,
so for this example we constructed three risk groups: low (<33%), intermediate (33-67%), and
high (>67%) risk, reflecting the fact that the average remaining lifespan of a 50 year old
Swedish man in 1970
2
was similar to the follow-up in the sample.
The SAS software needs the following input in this example: %nriidi(ds = name of the data
set, y = mortality, id = identification variable, model1n = chol, model1c = smok diab htn, model2n =
chol, model2c = smok diab htn mets, nriskcat = 3, riskcutoffs = 33 67, printtable = Y)
The Stata software needs the following input for calculating IDI: idi mortality smo diab htn
chol, prvars(mets); and the following input for calculating NRI: nri2 mortality smo diab htn chol,
2

prvars(mets) cut(33 67). The nri2 version of the command was chosen because we used two
risk cutpoints in this example.
The R software needs the following input : Model formula for logistic regression (in the
format mortality ~ smo + diab + htn + chol + metswith the new predictor last), ds (name of the
dataset), lim (vector with limits for risk strata), and npred (number of new predictors in the
model).
The resulting established measures and NRI and IDI are found in Tables 1 and 2. In the
example, C-statistics between the two models are not statistically significantly different, but
the NRI and IDI are. The clinical relevance of that phenomenon is unclear, and it is also an
area for further research how the magnitudes of NRI and IDI are to be interpreted.
Technically, the IDI is similar to the difference in Pearsons R
2
measures between two
models, or the difference in scaled Brier scores.
3
IDIs from different clinical fields may be
differently valued. Notably, the NRI reclassification table is important per se.





3

Table1.TestsofDiscriminatoryValue,GoodnessofFit,andCalibration
Estimate P-value
Odds ratio of total mortality corresponding
to having the metabolic syndrome (vs. not),
adjusted for established risk factors
1.55 (95% CI 1.22 to 1.96) <0.001

Results from a small and a larger logistic regression model
C-statistic 0.648 (95% CI 0.626 to 0.671)
HL chi
2
10.98 0.20
Model with established risk
factors (small model)
LR chi
2
160.85 <0.001

C-statistic 0.653 (95% CI 0.630 to 0.675)
HL chi
2
7.13 0.52
Model with established risk
factors and the metabolic
syndrome variable (larger
model)
LR chi
2
173.98 <0.001

Comparing the small and the larger models
Difference between C-statistics 0.19
LR test 13.12 0.0003
IDI 0.0053 (95% CI 0.0023 to 0.0083) 0.0006
NRI 0.0328 (95% CI 0.0075 to 0.0582) 0.011

4

Hosmer-Lemeshow (HL) tests based on 10 groups. CI, confidence interval; LR, likelihood
ratio; IDI, integrated discrimination improvement; NRI, net reclassification improvement
(risk classification tables corresponding to this NRI are presented in Table 2).


5

Table2.RiskClassificationTables
Classification tables of predicted risks from the small and the larger model
Non-Cases Predicted risks from model with established risk
factors and the metabolic syndrome variable
(larger model)
Low Intermediate High Total
Low 353 26 0 379
Intermediate 16 797 31 844
High 0 8 13 21
Predicted risks from
model with established
risk factors (small model)
Total 369 831 44 1244

Cases Predicted risks from model with established risk
factors and the metabolic syndrome variable
(larger model)
Low Intermediate High Total
Low 150 20 0 170
Intermediate 6 741 79 826
High 0 29 53 82
Predicted risks from
model with established
risk factors (small model)
Total 156 790 132 1078

Data in the tables are numbers of persons in classes of predicted risk from a small model and
a larger model (same models as in Table 1), separately for non-cases and cases. If the larger
model on average assigns a higher risk class to cases and a lower risk class to non-cases than
6

the small model (as in this example), NRI is positive (NRI for comparison of these two
models given in Table 1).
7

8


References
(1) Sundstrom J, Riserus U, Byberg L, Zethelius B, Lithell H, Lind L. Clinical value of the
metabolic syndrome for long term prediction of total and cardiovascular mortality:
prospective, population based cohort study. BMJ 2006 April 15;332(7546):878-82.
(2) SCB, Statistics Sweden. 9-12-2005.

(3) Steyerberg EW, Vickers AJ, Cook NR et al. Assessing the performance of prediction
models: a framework for traditional and novel measures. Epidemiology 2010
January;21(1):128-38.

Das könnte Ihnen auch gefallen