Beruflich Dokumente
Kultur Dokumente
When Averages Hide Individual Differences in Clinical Trials: Analyzing the results of clinical
trials to expose individual patient's risks might help doctors make better treatment decisions
Author(s): David Kent Rodney Hayward
Source: American Scientist, Vol. 95, No. 1 (JANUARY-FEBRUARY 2007), pp. 60-68
Published by: Sigma Xi, The Scientific Research Society
Stable URL: http://www.jstor.org/stable/27858901
Accessed: 08-09-2015 11:04 UTC
Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at http://www.jstor.org/page/
info/about/policies/terms.jsp
JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of content
in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms of scholarship.
For more information about JSTOR, please contact support@jstor.org.
Sigma Xi, The Scientific Research Society is collaborating with JSTOR to digitize, preserve and extend access to American
Scientist.
http://www.jstor.org
This content downloaded from 144.173.232.36 on Tue, 08 Sep 2015 11:04:52 UTC
All use subject to JSTOR Terms and Conditions
When
Hide
Averages
in Clinical
Differences
Individual
Trials
medicine at theUniversity of
He was the 2005
Michigan.
retary
forHealthAwardfor
career
accomplishments in
health-services research. Ad
60
American
Scientist, Volume
95
This content downloaded from 144.173.232.36 on Tue, 08 Sep 2015 11:04:52 UTC
All use subject to JSTOR Terms and Conditions
in Suffolk, England,
took 2,000 digital photos of people
living in the small town of Haverhill
Figure 1. In the year 2000 artist Chris Dorley-Brown
one image. This illustration shows 12
blended
into
all
had
been
until
these photographs,
and then used software tomerge
2,000
step by step,
has left out the original
a blend made up of all of the double portraits. (Dorley-Brown
double portraits (two photographs morphed
together) and
are
of thousands of individuals
to
treatment
the
In
the
modern
clinical
in
to
the
of
the
order
trial,
responses
protect
participants.)
privacy
portraits
are averaged,
in the same way the center photograph
in a single number
represents all the other individuals. As the data
typically summarized
to guide medical
practice.
important individual differences are lost. The authors propose ways to better examine clinical-trials results
assessment
group or the control group?and
of the outcome measure is typically blinded or
masked (the assessing physician does not know
whether patients received treatment).Random
ization is the feature that gives trials the power
to find the treatment's effect in the clutter of
differentpatient riskprofiles. Ifpatients are ran
domly assigned to the experimental and control
groups, risk factors should be equally distrib
uted between the groups. Thus any difference
in the aggregated outcomes of the two groups
can be attributed to the effectsof treatment.
The treatment-effect,as it is called, is typical
ly a single number that summarizes the overall
www.americanscientist.org
This content downloaded from 144.173.232.36 on Tue, 08 Sep 2015 11:04:52 UTC
All use subject to JSTOR Terms and Conditions
January-February
61
trialwith conventional
trialwith riskprofile-based
subgroup analysis
subgroup analysis
???4A
Si
A? &4
4?|
4*
4
benefitedfromtreatment
4?*
harmed
by treatment
4?
Figure 2. The patients in a large clinical trial inevitably have different health histories and risk factors. Many medical
investigators have assumed
more like the real
that this kind of heterogeneity
is an advantage
because
itmakes
the trial population
But summarizing
treatment
population.
results for a population
in a single number may gloss over subgroups
of patients whose
is quite different from the statistical average.
response
Conventional
"one variable at a time" subgroup analysis is unlikely to find subgroups with large differences
in response to therapy. Newer
types
of analysis, using risk profiles, may be more powerful
in grouping patients according
to their likelihood
of benefiting
from treatment.
American
Scientist, Volume
95
This content downloaded from 144.173.232.36 on Tue, 08 Sep 2015 11:04:52 UTC
All use subject to JSTOR Terms and Conditions
ofdying(albeitonlyslightly).
Benefits
Summarizing trial results may exaggerate the
benefit of treatment for some patients, but the
reverse is also possible: A negative overall result
can hide significant benefit to some patients.
Consider, forexample, the Atlantis ? trial,un
dertaken in the late 1990s. This trial tested the
efficacy of t-PA in treating strokes instead of
heart attacks. Strokes are trickier than heart at
tacks because the thrombolyticsmust be given
much sooner (within 3 rather than 12 hours)
and therisk of thrombolytic-related intracranial
hemorrhage ismuch greater (probably 6 or 7
percent instead of 1 percent).
Earlier clinical trials had shown that t-PA
did not yield any overall benefit if itwas ad
rrtinisteredmore than three hours after the pa
tient firsthad symptoms of a stroke. This short
window of opportunity meant t-PAwas given
to fewer than 5 percent of stroke patients. To
harm.
Estragon
72 years old
Vladimir
52 years old
mm
ewegency
diabetic
historyof stroke
stable vitalsigns but
highblood pressure
rapidpulse
low blood
pressure
electrocardiogram
indicates a massive
anterior-wall
no co-morbid
chronic
illnesses
heart
electrocardiogram
indicates a small
attack
inferior-wall heart
attack
mortality risk25%
more likelytobenefit
from treatment
mortalityrisk2%
less likely
to benefit;might
Ibe harmedby treatment
Figure 3. The common practice of treating everyone just tomake sure patients who will
if a treatment carries a risk of harm for some patients.
benefit get treated is dangerous
Two hypothetical
heart-attack patients illustrate this situation. Estragon is very ill and
would
benefit from t-PA, a clot-busting
probably
drug that performed better than
a patient at low risk of
in a randomized
clinical trial. Vladimir,
Streptokinase
dying
from his heart attack, might actually be harmed by themore potent agent, particularly
ifhe has risks for bleeding,
such as high blood pressure or a prior stroke.
www.americanscientist.org
This content downloaded from 144.173.232.36 on Tue, 08 Sep 2015 11:04:52 UTC
All use subject to JSTOR Terms and Conditions
January-February
63
Ail
Bal
overalloutcomes inATLANTIS
trial
s
2i
co
o "3 50
e o
Si
modified
Rankin
outcomes in low-risk
group inATLANTIS
I."... i control
experimental
trial
low-risk subgroup
benefitedfromtreatment
?
harmed
?
by treatment
trial looked at the outcome of stroke patients treated three to five hours after the onset of symptoms, slightly longer than
Figure 4. The atlantis
the recommended
cut-off. The collective results (top graph) indicated patients in the experimental
group did no better than those in the control
the authors looked at the outcomes among the one-third of patients at least risk of hemorrhage,
group, who were given a placebo. But when
they
found that they were helped by t-PA (bottom graph). In the initial analysis the benefit reaped by some was masked
by the harm others suffered.
that identify those patients least likely to suffer a treatment-related hemorrhage
based on pretreatment characteristics
(right) could
for t-PA to be extended in appropriate
situations. (Graph adapted from Kent, Ruthazer and Selker 2003.)
potentially allow the treatment window
Risk models
American
Scientist, Volume
that
one-variable-at-a-time
analysis
is
95
This content downloaded from 144.173.232.36 on Tue, 08 Sep 2015 11:04:52 UTC
All use subject to JSTOR Terms and Conditions
2007
www.americanscientist.org
This content downloaded from 144.173.232.36 on Tue, 08 Sep 2015 11:04:52 UTC
All use subject to JSTOR Terms and Conditions
January-February
65
scores reported for clinical trials can be niisleading, and trials are
physicians are aware that the single-number
one
or
to
the
of
exarrtine
another attribute or risk factor on the effectof an intervention.
Most
usually analyzed
impact
But these single-factor analyses are much less likely to yield useful insights than ismultifactor risk stratification.
The difference in the power of these two types of analyses is demonstrated by a clinical-trial simulation. The virtual
patients in this simulation could have any of six differentrisk factors, each of which increased the risk of a bad outcome
by a factor of two. The prevalence of the risk factors varied from 10 percent to 40 percent. Treatment decreased the rela
tive risk of the outcome (a negative event used tomeasure treatment efficacy) at the five-yearmark by 50 percent. But it
also led to three bad outcomes per 1,000 patients per year.
On thewhole the treatment benefited the virtual population: The treatment-effect,expressed as the relative risk reduc
tion, was 23 percent. The statistical power of the simulated trial (A) was also roughly similar to those ofmany real trials.
Given the degree of benefit and the number of patients in the trial, there is a 74 percent chance a single run of the trial
would have a statistically significant result.
true
control
event rate
(outcome
risk)
risk-factor
distributor
randomizer
overall
results
5.6%
true
relative risk
reduction
(treatment
statistical
power
effect)
23%
0.74
When the six risk factorswere used one by one to divide the patients into two groups (B), the treatment-effectdidn't
vary substantially among the groups. A single risk factordidn't much change the patients' risk of suffering the outcome,
and the presence of other risk factors helped obscure what impact itdid have.
are relativelysmall, itwas unlikely thatany one of these single-factoranalyses
Because thedifferences in the treatment-effect
would have statisticallysignificantresults. Indeed, even ifa risk factorwas present in40 percent of the trialsubjects (bottomrow),
therewas only a 19 percent chance that therewould be a significantdifferencebetween the two groups in the treatment-effect.
risk factor
#1 present
absent
#3 present
absent
in 25%
in 75%
#6 present
single-risk-factor
sorter
in 10% of subjects
in 90%
in40%
in60%
absent
of subjects
of subjects
of subjects
of subjects
of subjects
9.5%
5.1%
8.5%
4.6%
7.8%
4.2%
33%
20%
0.11
30%
18%
0.16
29%
14%
0.19
In contrast,when a risk index or scorewas used todivide patients into subgroups (C),we saw tremendous variation both
in the outcome risk and the treatment-effect.
The lowest-riskgroup had an outcome risk of 1.5 percent; thehighest-risk group
The
had a risk of 23.1 percent Because of thevariation inoutcome risk, therewas also largevariation in the treatment-effect.
treatment-effectreversed sign forthe lowest-riskgroup. Their likelihood of sufferinga bad outcome increased49 percent (al
a
a
though this corresponds to just small absolute increase in risk).The highest-risk group had 40 percent relative risk reduction
with treatment.Given the large variation in treatment-effect,theanalysis isquite powerful. The chance of finding a statistically
as finding a treatment-effectin the overall trial.
significantdifference between risk strata is almost as high
4-6
risk factors
3 risk factors
2 risk factors
1 risk factor
riskmodel
0 risk factors
If the six factors are used to assign each patient a risk score and the patients are then divided into two groups of
roughly equal size based on their scores (D), the variation in the outcome risk and the treatment-effectremains large
and the statistical power relatively high. Together
c
these simulations demonstrate thatmore realistic risk
9.0%
2 or more risk factors
0.58
to
treatment
is
the
decisions
analysis
likely
improve
2.6%
0-1 risk factors
physicians and patients must make.
66
American
Scientist,
Volume
95
This content downloaded from 144.173.232.36 on Tue, 08 Sep 2015 11:04:52 UTC
All use subject to JSTOR Terms and Conditions
Viola
Cesario
more
than three
months ago:
transient loss of
vision inone eye
no symptoms since
cerebral angiogram
showed a 90%
blockage, with a
smooth plaque
causing a 70%
stenosis
highly irregular
border
ofhighblood
no history of high
blood pressure and
peripheral vascular
disease
pressure and
peripheral vascular
disease
should
should be treated
not be treated
baseline
would
men
stood
actually
50-69% stenosis
smooth
stenosis
to benefit
70-99%
smooth
stenosis
ulcerated/
irregular
stenosis
ulcerated/
irregular
5-year
riskof
stroke
(percent)
stroke
<10
TIA
10-15
ocular
15-20
-r V
timesince lastevent (weeks)
women
r \/
20-25
50-69% stenosis
smooth
stenosis
?> -7N
\/ni ?>
70-99%
smooth
stenosis
ulcerated/
irregular
25-30
30-35
stenosis
ulcerated/
]35-40
irregular
140-^5
I45-50
|>50
stroke
TIA
ocular
V t>
-7N\/ ni >
timesince lastevent (weeks)
7. Following
the reanalysis
of the trial described
that a chart like that shown above might make
suggested
to physicians.
The chart presents
analysis more accessible
Figure
on five patient
of the patient's most
stroke based
nature
age, amount
of stenosis
characteristics
severe
in addition
recent event
(a TIA
(Adapted
is a transient
from Rothwell
2007
www.americanscientist.org
This content downloaded from 144.173.232.36 on Tue, 08 Sep 2015 11:04:52 UTC
All use subject to JSTOR Terms and Conditions
ischemie
attack),
et al. 2005.)
January-February
67
links,
this issue of
American
Scientist
Online:
http://www.american
scientist.org/
lssueTQ?/issue/921
68
American
Scientist, Volume
Ioannidis,
50(10):1089-1098.
Epidemiology
of the base
Ioannidis, J.R, and J.Lau. 1998. Heterogeneity
line risk within patient populations
of clinical trials:
a
evaluation
algorithm. American Journal of
proposed
148(11):1117-1126.
Epidemiology
infarction who
for selecting patients with myocardial
are
activa
likely to benefit from tissue plasminogen
tor compared with Streptokinase. American Journal of
Medicine
113:104-111.
The Lancet.
P.
S. A. Gutnikov
.,Z. Mehta, S. C. Howard,
P. Warlow.
2005. Treating individuals 3: from
ex
to individuals: general
subgroups
principles and the
ample of carotid endarterectomy. The Lancet Qan 15-21)
Rothwell,
and C
365(9455):256-265.
95
This content downloaded from 144.173.232.36 on Tue, 08 Sep 2015 11:04:52 UTC
All use subject to JSTOR Terms and Conditions