Beruflich Dokumente
Kultur Dokumente
Methodological note
a
Pole reeducation-readaptation, CHU de Dijon, 23, rue Gaffarel, 21000 Dijon, France
Inserm U887, faculte des sciences du sport, universite de Bourgogne, campus universitaire Montmuzard, 21078 Dijon, France
c
CIC-P Inserm 803, 23, rue Gaffarel, 21000 Dijon, France
d
UMR UJF CNRS 5525, laboratoire TIMC-IMAG, clinique MPR, institut de reeducation, hopital Sud,
CHU de Grenoble, BP 338, 38434 Echirolles cedex, France
e
Service de reeducation neurologique, centre helio-marin, CHU de Nmes, 30240 Le Grau-du-Roi, France
f
Laboratoire depidemiologie et de biostatistiques, IURC, 75, rue de la Cardonille, 34093 Montpellier, France
Abstract
Background. Many clinical scales contain items that are scored separately prior to being compiled into a single score. However, if the items
have different degrees of importance, they should be weighted differently before being compiled. The principal aims of this study were to show how
the analytic hierarchy process (AHP), which has never been used for this purpose, can be applied to weighting the six items of the London
handicap scale, and to compare the AHP to the conjoint analysis (CA), which was previously implemented by Harwood et al. (1994) [1].
Design. In order to assess the relative importance of the six items, we submitted AHP and CA to a group of 10 physiatrists. We compared the
methods in terms of item ranking according to importance, assessment of fictitious patients based on weights determined by each method, and
perceived difficulty by the physiatrist.
Results. For both techniques, Physical independence (PHY) was the best-weighted item, but other ranks varied depending on the
technique. AHP was better than CA in terms of accuracy (global assessment of the clinical status) and perceived difficulty.
Conclusion. AHP may be used to reveal the importance that experts assign to the items of a multidimensional scale, and to calculate the
appropriate weights for specific items. For this purpose, AHP seems to be more accurate than CA.
# 2010 Elsevier Masson SAS. All rights reserved.
Keywords: Outcome assessment (health care); Validation studies; Scoring methods; Decision theory
Resume
Position du proble`me. Les items dune echelle clinique multidimensionnelle sont habituellement cotes separement avant detre simplement
additionnes pour former le score total. Toutefois, avant cette addition, ils devraient theoriquement etre ponderes en fonction de leur importance
respective. Les objectifs du present travail etaient : (1) de montrer que la methode du processus de hierarchie analytique (PHA), jamais utilisee
dans ce cadre, permet daffecter des poids aux six items de la London Handicap Scale en fonction de leur importance, et (2) de la comparer a`
lAnalyse Conjointe (AC) precedemment utilisee par Harwood et al. (1994) [1].
Methode. Afin devaluer limportance relative des items, nous avons soumis dix medecins specialistes en medecine physique et readaptation
aux deux methodes PHA et CA. Nous avons compare les deux techniques en ce qui concerne le classement ditems par ordre dimportance,
levaluation du statut de patients fictifs avec utilisation des poids calcules et la difficulte percue par les medecins de mise en uvre de la technique.
Resultats. Pour les deux methodes, litem independance physique etait le mieux pondere, mais les autres etaient ponderes differemment
selon la technique utilisee. PHA etait superieure a` CA en termes de precision pour levaluation globale du patient et de facilite de mise en uvre.
* Corresponding author.
E-mail address: charles.benaim@chu-dijon.fr (C. Benam).
0398-7620/$ see front matter # 2010 Elsevier Masson SAS. All rights reserved.
doi:10.1016/j.respe.2009.09.004
60
Conclusion. La methode du PHA peut etre employee pour evaluer limportance accordee a` des experts aux dimensions dune echelle
multidimensionnelle et pour calculer les poids des items. Dans ce cadre, PHA semble superieure a` CA.
# 2010 Elsevier Masson SAS. Tous droits reserves.
Mots cles : Evaluation ; Etude de validation ; Methode de scorage ; Theorie de la decision
1. Introduction
Health measurement is a fundamental issue in medicine, and
measurement scales are increasingly used by clinicians to assess
patients health. In Pubmed database, we identified a considerable increase in articles devoted to scale validation (key words
scale and validation): 34 articles in 1988, 152 in 1998, and
1288 in 2008. Most measurement scales comprise several ordinal
items, which are summed in order to produce a single score.
Consequently, all items have the same mathematical weight.
However, when an item is more important than another, its
weight should be increased accordingly. For instance, consider a
two-dimensional scale made up of two items scored 0 to 5:
mobility (MOB) and physical independence (PHY). If
MOB is twice PHY, then the handicap score should be
H = 2 MOB + PHY instead of H = MOB + PHY, or MOB
should be changed to an item scored 0 to 10.
To date, only few authors have questioned the way in which
item weights should be determined in a multi-item scale and have
come up with solutions taken from non-medical literature.
Harwood et al. used the technique of conjoint analysis (CA) to
develop a 6-item handicap measurement [1]. They built a matrix
of scale weights (or part utilities) relating to the different
levels of each item, the overall handicap score of a patient being
the sum of the scale weights corresponding to the clinical status.
Many other techniques have been developed to reveal the
importance which experts give to criteria, which are known as
criteria-weighting techniques. The analytic hierarchy process
(AHP) is one of the most widely recognized criteria-weighting
techniques in the field of multiple-criteria decision-making [2]. It
is based on the construction of a matrix diagramming the
comparisons of the criteria to one another (corresponding to the
preferences of the decision maker, from which criteria weights
are derived). AHP has been used in a variety of contexts, ranging
from strategic planning to solving international conflicts [3]. It
has also been successfully used in medical decision making [4
6], involving patients in decisions about their care [7], hospital
management problems [8] and problems of selecting medical
staff [9], but we do not have knowledge concerning the use of
AHP to calculate the weights of clinical items.
The principal aims of the present study were:
to show how AHP, which has never been used for this
purpose, can be used to weight the items of a multi-item scale;
to compare AHP with CA.
2. Methods
We choose the so-called London handicap scale as an
example of a multi-item scale. It is made up of six ordinal items:
Table 1
An example of reciprocal pairwise comparison matrix filled during physiatrist
interview.
MOB
PHY
OCC
SOC
ORI
ECO
MOB
PHY
OCC
SOC
ORI
ECO
1
1/2
1/5
1/7
1
1/9
2
1
1/3
1/6
3
1/8
5
3
1
1/3
7
1/5
7
6
3
1
7
1/3
1
1/3
1/7
1/7
1
1/9
9a
8
5
3
9
1
61
Table 2
An example of fictitious profile to be assessed by the physiatrists (5,5,3,3,1,1).
Item range 15.
Item
Description
Mobility
Physical independence
Occupation
Social integration
Orientation
Economic self sufficiency
Table 3
Means of scale weights associated with item levels.
Items
Mobility
Physical independence
Occupation
Social integration
Orientation
Economic self sufficiency
Level 1
Level 2
Level 3
Level 4
Level 5
Importance
0.43
0.06
0.82
0.18
0.54
1.17
5.29
1.47
1.94
3.49
3.06
4.71
10.14
2.88
4.71
7.17
6.66
8.24
10.99
11.22
11.62
11.27
11.44
10.48
11.83
25.32
18.53
15.38
16.22
12.73
11.40
25.38
19.35
15.56
16.77
11.55
Values have been re-scaled in order to obtain calculated profile scores ranging from 0 to 100. Importance = level 5 level 1.
62
Table 4
Concordance between the 10 binary choices and methods CA and AHP for the 10 physiatrists (Kappa coefficients).
Methods
CA
AHP
Physiatrists
CF
PG
RE
VC
AA
0.19
0.40
0.42
0.36
0.57
0.57
0.55
0.31
0.07
1.00
EC
0.03
0.74
BM
0.15
1.00
JP
0.03
0.74
AB
0.03
0.18
SL
0.40
0.21
the absence of differences between the methods, as it is wellknown that the smaller the number of criteria, the easier the
weight estimation, and the smaller the differences between
methods [16].
In this study, the importance of items was quite different
from the one calculated by Harwood et al., using CA with the
same handicap items (in a 6-level version) [1]. We found PHY
to be the item with the highest importance, while Harwood
found ECO to be the most important, followed by ORI, MOB,
PHY, OCC and SOC. This difference is due to the different
origins of populations included in the studies (physiatrists vs
general population), and remind us that CA (or any criteriaweighting technique) is not intended for estimating the real
scale weights of a multi-dimensional scale, but to reveal
preferences of the subjects interviewed about the scale. As a
consequence, the weights calculated above are (exclusively)
suited to an objective measurement of a patients status by
physiatrists. For another purpose, such as a self-assessment
made by consumers, a new set of weights should be estimated.
Stineman et al. found such a difference in the assessment of
functional states, clinicians placing greater value on cognitive
skills than do consumers [15]. As stressed in the new internal
classification of functioning, disability and health, the
patients environment has to be taken into account and is thus
another possible major determinant of item weights. For
example, the ability to climb stairs is crucial for a stroke patient
living at home without an elevator, while it is less important for
a patient living in a single-storey house. The item climbing
stairs (or the level able to climb stairs in a MOB item)
should then be weighted accordingly.
Although it seems obvious to us that weighting items is
essential prior to using a multi-item scale in a particular
context, it is also true that weighting techniques are not always
easy to implement. In the present study, we submitted to experts
15 pairwise comparisons (AHP) and 18 fictitious profiles (CA)
to assess the weights of six items. The interview conditions
were acceptable, but they would have been very difficult with a
greater number of items. For instance, the number of pairwise
comparisons (AHP) should have been 45 for a 10-item scale,
which is nearly impossible to implement. A large number of
items is thus a serious limitation for criteria-weighting
techniques. In such cases, grouping items into a small number
of homogeneous clusters prior to applying weighting techniques to the clusters may be an acceptable solution.
Validity and reliability are the main qualities to be assessed for
the validation of a multi-dimensional clinical scale [17].
However, item weighting should also be an important stage in
the validation process, since items will be aggregated into a
single global score. This can be done either along with the
63
[9] Weingarten MS, Erlich F, Nydick RL, Liberatore MJ. A pilot study of the
use of the analytic hierarchy process for the selection of surgery residents.
Acad Med 1997;72:4002.
[10] World Health Organization. International classification of impairments,
disabilities and handicaps. Geneva: WHO, 1980.
[11] Schoemaker P, Carter Waid C. An experimental comparison of different
approaches to determining weights in additive utility models. Mgmt Sci
1982;28:18296.
[12] NCSS: Number cruncher statistical system (Computer Program), version
97. Kaysville, Utah; 1997 (program).
[13] Matlab (Computer program), version 5.1. USA;1997 (program).
[14] Mousseau V. Analyse et classification de la litterature traitant de limportance relative des crite`res en aide multicrite`re a` la decision. Rech Oper
1992;26:36789.
[15] Stineman M, Maislin G, Nosek M, Fiedler R. Comparing consumer and
clinician values for alternative functional states: application of a new
feature trade-off consensus building tool. Arch Phys Med Rehabil 1998;
79:15229.
[16] Adelman L, Sticha P, Donnell M. The role of task properties in determining the relative effectiveness of multi-attribute weighting techniques.
Organiz Behav Hum Perf 1984;33:24362.
[17] McDowell I, Newell C. Measuring health: a guide to rating scales and
questionnaires. New York: Oxford University Press; 1987.
[18] Fisch H, Hammond K, Joyce C, OReilly M. An experimental study of the
clinical judgment of general physicians in evaluating and prescribing for
depression. Brit J Psychiat 1981;138:1009.
[19] OBoyle C, McGee H, Hickey A, OMalley K, Joyce C. Individual quality
of life in patients undergoing hip replacement. Lancet 1992;339:108891.
[20] Wainer H. Estimating coefficients in linear models: it dont make no never
mind. Psychological Bull 1976;83:2137.
[21] Eckenrode R. Weighting multiple criteria. Mgmt Sci 1965;12:18091.
[22] Nutt C. Comparing methods for weighting decision criteria. Omega
1980;8:16372.
[23] Belton V. A comparison of the analytic hierarchy process and a simple
multi-attribute value function. EJOR 1986;26:721.
[24] Borcherding K, Eppel T, von Winterfeld D. Comparisons of weighting
judgments in multiattribute utility measurements. Mgmt Sci 1991;37:
160319.
[25] Olson D, Moshkovich H, Schellenberg R, Mechitov A. Consistency and
accuracy in decision aids: Experiments with four multiattribute systems.
Decis Sci 1996;26:72348.
[26] Weber M, Borcherding K. Behavioral influences on weight judgments in
multiattribute decision making. EJOR 1993;67:112.
[27] Roy B. Multicriteria methodology for decision aiding. Dordrecht: Kluwer
Academic; 1996.