Beruflich Dokumente
Kultur Dokumente
OF
DETECTION
OF
Helen A. Clare, MAppSc,a Roger Adams, PhD,b and Christopher G. Maher, PhDc
ABSTRACT
Background and Purpose: The poor reliability of lateral shift detection has been attributed to lack of rater
training, biologic variation, and test reactivity. This study aimed to remove the potential confounding arising from
biological variation and test reactivity and control the level of rater experience/training in making judgments of lateral
shift.
Subjects: One hundred forty-eight raters with 3 levels of clinical physical therapy experience and training in the
McKenzie method participated.
Method: The raters viewed photographic slides of 45 patients with low back pain. Slides were judged on a
numerical scale for presence and direction of a shift. Intrarater reliability was evaluated using the intraclass
correlation coefficient (ICC) and interrater reliability was evaluated using both the ICC and statistic.
Results: Reliability of shift judgments was only moderate for all groups (eg, ICC [2,1] values ranged from 0.48 to
0.64).
Conclusion: Lateral shift judgements have only moderate reliability, even when trained raters judge stable stimuli.
We propose that the photo model employed can be used to explore the source of error in this process. (J
Manipulative Physiol Ther 2003;26:476-80)
Key Indexing Terms: Low Back Pain; Lumbar Spine; Lateral Shift; Reliability of Testing; McKenzie Method
INTRODUCTION
recent survey of physical therapists in the United
States1 reported that a McKenzie evaluation was
one of the most common evaluations performed for
patients with low back pain (LBP) and that almost half the
therapists viewed the McKenzie method as the most useful
management approach for low back pain. Similar results
have been reported for British and Irish physiotherapists.2,3
The method has received support as an effective LBP
treatment in a systematic review of activity prescription for
back pain4 and also in Danish clinical practice guidelines,5
based on the 2 existing clinical trials.6,7 Subsequent to the
completion of both reviews, Cherkin et al8 published a
clinical trial evaluating 3 approaches (chiropractic manipulation, McKenzie therapy, and an educational booklet) and
476
METHOD
Project Overview
The design of the experiment required raters to inspect a
set of photographic slides of patients with low back pain and
to judge whether a shift was present. The photographs of the
patients had been taken by the first author on the same day
that she performed a full clinical examination of these
patients. On the same visit. demographic and clinical data
were recorded for each patient.
Subjects
Patients with low back pain. Patients attending a private physiotherapy clinic for low back pain were invited to participate
in the study. The criteria for inclusion were that they were
currently experiencing low back pain with or without radiation to the leg.
All subjects gave written consent prior to participating.
Information was collected from the subjects regarding their
gender, age, weight, height, location of symptoms, duration
of symptoms, working status, previous history of LBP, pain
intensity, frequency, and functional status (Table 1).
Raters. The raters consisted of:
60 first-year undergraduate physical therapy students
with no clinical experience or training in the McKenzie
method.
477
478
45
50.6 (14)
164 (12)
73 (15)
5.6 (1.7)
47.3 (19)
58%
87%
51%
18%
31%
51%
56%
27%
44%
Data for continuous variables are mean values with SDs in parentheses,
categorical variables are percentages.
46 graduate physical therapists with some clinical experience but no formal training in the McKenzie
method.
42 graduate physical therapists who had clinical experience and had completed a minimum of 70 hours of
formal training in the McKenzie method.
Procedure
Investigator HC conducted a complete clinical examination of each patient and then asked the patient to stand
within a doorway with their back toward a camera (Cannon
EOS 3000 88, Tokyo, Japan). The camera was placed on a
tripod 3 meters from the subjects, set on auto focus, and was
able to be activated from a distance. A photograph was then
immediately taken. The patient then resumed their normal
treatment.
The photographs were converted into slides and duplicates were made, which resulted in 90 slides of the 45
patients. These were randomly positioned in a slide tray so
that the order of the second set of slides varied from the first
set. The slides were shown to the 3 sets of raters. The
instructions given to the raters were that they were to
determine the presence or not of a lumbar lateral shift. They
were read the following:
McKenzie 1981 defines a lateral shift as when the top half of
the patients body has moved laterally in relation to the bottom
half.
Data Analysis
Reliability of detecting a shift. The raters judgements were converted to a 5-point scale of confidence that the patient had a
right shift: 2 certain shifted to the left; 1 uncertain
shifted to the left; 0 neutral; 1 uncertain shifted to the
right; 2 certain shifted to the right. Intrarater reliability
was determined by comparing the judgments of lateral shifts
of the first presentation of the 45 subjects with the second
presentation. This was performed for all raters.
Intrarater and interrater reliability were evaluated by calculating the intraclass correlation coefficient (ICC [2,1]) for
each group, using the SPSS Macro ICCSF2.SPS within
SPSS 10.0 (SPSS, Chicago, Ill). This analysis considers the
data as continuous data, whereas others may consider the
data to represent ordinal data. However, the argument is
unnecessary, as Fleiss and Cohen18 have shown that
weighted and the ICC are equivalent. To allow comparison with other studies that have evaluated reliability with ,
we calculated the multirater (an unweighted form of )
using the MKAPPASC.SPS macro in SPSS 10.0.
The intrarater reliability (ICC 2,1) for each subject was
determined and then a group mean value and 95% CI for the
group was determined. This was done for each of the 3
groups. The interrater reliability (as determined by the ICC
and ) were calculated for each of the 3 groups. Ninety-five
percent CIs for each statistic were calculated.
RESULTS
Intrarater reliabilities, as expressed by ICC values with
95% CIs, are shown in Table 2. The ICC values ranged from
0.48 to 0.59, which falls within the range of ICC values
described by Fleiss19 as representing fair to good reliability.
The interrater reliabilities, expressed as ICC values, ranged
from 0.49 to 0.64, again in the range representing fair to
good reliability. For both intrarater and interrater reliability,
inspection of the 95% CIs reveals that the McKenzie group
had statistically significantly greater reliability than the
other groups. The (Table 2) ranged from 0.26 to 0.38,
again suggesting fair reliability.20
DISCUSSION
Despite using a simplified model of clinical practice that
removed any potential for reactivity and biologic variation,
the reliability of shift detection remained unacceptably low.
While the McKenzie trained raters were more reliable in
judging a shift than the other 2 groups of raters, the absolute
difference between groups was small and was revealed as
statistically significant because of the high power of the
Intrarater*
Raters
ICC
ICC
Kappa
First-year students
Graduate physical therapist
McKenzie trained physical therapists
0.56 (0.53-0.59)
0.48 (0.43-0.53)
0.59 (0.55-0.63)
0.53 (0.46-0.61)
0.49 (0.42-0.51)
0.64 (0.57-0.71)
0.36 (0.35-0.37)
0.26 (0.25-0.27)
0.38 (0.37-0.39)
Point estimate and 95% CI for a single ICC or Kappa that compares multiple raters.
CONCLUSION
Despite the task of judging the presence or absence of a
lateral shift being simplified by the removal of biologic
variation and test reactivity, the reliability of the raters in
this study was unacceptable. We recommend that this model
utilizing photographs of LBP patients be used to further
study the features of the lateral shift that influence the raters
decision as to its presence and direction. Once these have been
ACKNOWLEDGMENTS
This study was approved by the Human Research Ethics
Committee of the University of Sydney.
REFERENCES
1. Battie MC, Cherkin DC, Dunn R, Ciol MA, Wheeler K.
Managing low-back pain: attitudes and treatment preferences
of physical therapists. Phys Ther 1994;74:219-26.
2. Foster NE, Thompson KA, Baxter GD, Allen JM. Management of nonspecific low-back pain by physiotherapists in
Britain and Ireland. Spine 1999;24:1332-42.
3. Hurly DA, Dusior TE, McDonough SM, Moore AP, Linton
SJ, Baxter GD. Biopsychosocial screening questionnaire for
patients with low-back pain: preliminary report of utility in
physiotherapy practice in Northern Ireland. Clin J Pain 2000;
16:214-28.
4. Maher C, Latimer J, Refshauge K. Prescription of activity for
low-back pain: what works? Aust J Physiother 1999;45:12132.
5. Danish Institute for Health and Technology Assessment. Lowback pain. Frequency, management and prevention from an
HITA perspective. Danish Health Technol Assess 1999;1:1106.
6. Stankovic R, Johnell O. Conservative treatment of acute lowback pain. A 5-year follow-up study of two methods of treatment. Spine 1995;20:469-72.
7. Nwuga G, Nwuga V. Relative therapeutic efficacy of the
Williams and McKenzie protocols in back pain management.
Physiother Pract 1985;1:99-105.
8. Cherkin D, Deyo R, Battie M, Street J, Barlow W. A comparison of physical therapy, chiropractic manipulation, and
provision of an educational booklet for the treatment of patients with low-back pain. N Engl J Med 1998;339:1021-29.
9. McKenzie RA. The lumbar spine: mechanical diagnosis and
therapy. Waikanae, New Zealand: Spinal Publication Limited;
1981. p. 24-6.
10. Porter RW, Miller CG. Back pain and trunk list. Spine 1986;
11:596-600.
11. Riddle DL, Rothstein JM. Intertester reliability of McKenzies
classifications of the syndrome types present in patients with
low-back pain. Spine 1993;18:1333-44.
12. Donahue MS, Riddle DL, Sulivan MS. Intertester reliability of
a modified version of McKenzies lateral shift assessments
obtained on patients with low-back pain. Phys Ther 1996;76:
706-16.
479
480