Concurrent Validity and Reliability of Clinical Evaluation of The Single Leg Squat

Physiotherapy Theory and Practice, 27(8):586–594, 2011
Copyright © Informa Healthcare USA, Inc.

ISSN: 0959-3985 print/1532-5040 online
DOI: 10.3109/09593985.2011.552539
CLINICAL TECHNICAL NOTE
Concurrent validity and reliability of clinical evaluation

of the single leg squat
Daniel R. Poulsen, PT, PhD, MA, OCS and C. Roger James, PhD
Assistant Professor, Center for Rehabilitation Research, Texas Tech University Health, Sciences Center, Lubbock, Texas,
USA
Physiother Theory Pract Downloaded from informahealthcare.com by Michigan University on 11/05/14
ABSTRACT
This study determined reliability and concurrent validity of measurements of the single leg squat made by novice
examiners. Twelve video recordings of individuals performing a single leg squat were evaluated by six student
physical therapists. Students assessed movement quality on an ordinal scale and manually measured frontal
plane knee movement quantity on a video monitor. Inter- and intrarater reliability of ordinal scale ratings were
determined via quadratically weighted kappa. Inter- and intrarater reliability of frontal plane knee measures
were determined through intraclass correlation coefficient models 2,k and 3,k (k = 3 ratings), respectively.
Concurrent validity of frontal plane knee measures was examined by comparison with Vicon-Peak motion-
tracking system measures via Bland-Altman scatterplots. Ordinal scale measures displayed intrarater reliability
ranging from 0.38 to 0.94 and interrater reliability of 0.68 (0.46–0.87). Intrarater reliability of frontal plane knee
For personal use only.
measures ranged from 0.88 to 0.98 and interrater reliability of 0.99 (0.97–1.00). Difference scores between
student and computer-generated measures of frontal plane knee movement were significantly different as
determined through Bland-Altman scatterplots and calculation of the upper and lower limits of agreement.
INTRODUCTION setting that can measure patient movement during the

performance of functional tasks.
It has been proposed that functional movements, such The frontal plane projection angle (FPPA) is a two-
as the single leg squat, can be measured to assess pre- dimensional (2D) knee angle measure that is manually
disposition to common degenerative and traumatic performed by using software or traditional angle-
musculoskeletal injuries (Chmielewski et al, 2007; measuring devices, such as a goniometer, on a compu-
Hewett et al, 2005; McLean, Huang, and vanden ter monitor (Willson and Davis, 2008a). Knee valgus
Bogert, 2005; Willson and Davis, 2007). For angle during weight-bearing tasks, as measured
example, evidence indicates that excessive valgus through FPPA, was shown to be greater in female sub-
angle at the knee during functional tasks is a risk jects diagnosed with patellofemoral pain syndrome
factor for noncontact anterior cruciate ligament than healthy control subjects (Willson and Davis,
injury (Griffin et al, 2006). Because the measurement 2007; Willson and Davis, 2008b). Despite the early
methods used to detect abnormal movement patterns promise of the FPPA as a useful clinical tool, a
during functional activities are expensive and techni- review of the literature indicates an absence of evi-
cally sophisticated, it has been difficult to replicate dence about the reliability and validity of measures
these types of movement analyses in the clinical made by novice clinicians when using this measure-
setting. Therefore, attempts have been made to bring ment method.
more cost-effective and pragmatic tools to the clinical Chmielewski et al (2007) used an ordinal scale for
the measurement of quality of motion displayed
during performance of single leg activities. This
group found intrarater reliability of experienced clini-
Accepted for publication 19 November 2010 cians, as measured by weighted kappa coefficients, to
range between 0.38 and 0.68 when used to assess
Address correspondence to Daniel R. Poulsen, Center for Rehabilitation
Research, Texas Tech University Health, Sciences Center, 3601 4th lateral step down (LSD) and single leg squat (SLS)
Street, Lubbock, TX 79430-6280. E-mail: dpoulsen@rocketmail.com quality of motion. It was suggested in the same
586
Physiotherapy Theory and Practice 587
report that the relatively low reliability might have been measures. Concurrent validity of student-generated
due to a lack of heterogeneity in the subjects being as- FPPA measures was determined by comparison to cri-
sessed and the use of a nonintuitive rating system that terion standard values generated by a computerized
consisted of a set of arbitrary symbols. Chmielewski motion-tracking system.
et al (2007) concluded that this method of assessment
can be used by clinicians to produce ratings of
movement with reliability greater than would be Subjects performing single leg squat
found by chance, but less than what is necessary for
clinical use.
Twenty-two individuals were recruited from a sample
These newer methods for assessing movement hold of convenience at a local university through personal
promise as useful clinical tools for measuring patient
communication. A solicitation script was used to
motion while performing a functional task. However,
recruit these individuals. The inclusion criterion was
a review of literature indicates that nothing is known
age of 18-60 years. Exclusion criteria consisted of

of the reliability of measures made by novice clinicians
current injury to either lower extremity causing pain
when using the ordinal scale described by Chmielews-
or requiring treatment from a health care practitioner,
ki et al (2007) and the FPPA method of assessment medical history including previous injury to either
described by Willson and Davis (2007). In addition,
lower extremity that required health care services
recent publications mentioned here indicate that auto-
lasting greater than 1 week during the subject’s life-
matic computerized tracking and calculation of the time, self-reported balance disorder, or pregnancy.
knee frontal plane projection angle may be a useful
clinical measure. However, the relationship of hand-
measured FPPA to computer-generated FPPA has
not yet been investigated. If these tools are to be Protocol for obtaining FPPA performance
used as standard methods of movement assessment data of each SLS performance
in physical therapy practice, it is essential that ex-

pected levels of reliability and validity of measures An electronic goniometer (elgon; model LS800 and
made by clinicians be elucidated, particularly in phys- SG150; Biometrics, Ltd., Ladysmith, VA, USA) was
ical therapists with limited clinical experience. secured to the lateral aspect of the thigh and leg of
Therefore, the objectives of the study were to 1) the stance (dominant) limb of each subject to
investigate the intra- and interrater reliability of measure knee flexion angle in real time for the
measures of single leg squat (SLS) movement pro- purpose of standardizing depth of single leg squat.
duced by novice physical therapists when using a The dominant limb was defined as the limb that the
more intuitive numerical ordinal scale; 2) investigate subject would use to kick a ball. The elgon recorded
the intra- and interrater reliability of 2D FPPA (120 Hz) sagittal (flexion-extension) plane knee
measures of SLS performance produced by novice motion. Real-time sagittal plane elgon data were
physical therapists; and 3) investigate the concurrent used as biofeedback to inform the subject when the
validity of 2D knee FPPA measures produced by desired amount of knee flexion (45°) had been
novice physical therapists through comparison with a reached during the SLS.
Vicon-Peak (120 Hz, Motus 9.0; Vicon-Peak, Cen- By using previous works of Willson and Davis
tennial, CO, USA) 2D computerized movement (2008a; 2008b), a method of capturing 2D knee
tracking system which also produces a 2D measure motion was devised to create measures of the FPPA
of FPPA. It was hypothesized that 1) ordinal scale through a computerized motion-tracking system. Six
and FPPA measures would exhibit levels of intra- reflective markers were applied to the subject, one at
and interrater reliability considered adequate for use each anterior superior iliac spine (ASIS), two on a
in the clinical setting and 2) measures of the FPPA line approximating the mechanical axis of the thigh
would not significantly differ from those of the com- segment, and two on a line approximating the mech-
puterized movement-tracking system. anical axis of the leg segment (Figure 1). Two
additional reflective markers were placed 1 meter
apart on the floor to create a horizontal reference
MATERIALS AND METHODS line of known length used to verify the 2D scaling. Re-
flective markers were automatically tracked through a
Experimental design computerized motion-tracking system (120 Hz,
Motus 9.0; Vicon-Peak, Centennial, CO, USA) to de-
A test-retest design was used to examine the intra- and termine the criterion standard FPPA measure. The
interrater reliability of ordinal scale and FPPA raw x- and y-coordinate data were scaled by using a
Physiotherapy Theory and Practice

588 Poulsen et al.
Following a short instruction and demonstration by

the principal investigator, the subject was asked to
practice the SLS maneuver. After the subject felt com-
fortable with performing the maneuver, he or she then
performed five repetitions (trials) of the SLS. All five
trials were subsequently reviewed by the principal
investigator, and one video file was chosen for later
analysis by the novice clinician raters. Depth of squat
was standardized by using the elgon, which informed
the subject and investigator when knee flexion
reached 45°. Knee flexion of 45° was used to approxi-
mate a typical attainable squat depth commonly used
during functional activities. The rate of SLS motion
was controlled by instructing the patient to descend

to 45° of knee flexion over a minimum 1 second
period and returning to upright position over a
FIGURE 1 Reflective marker configuration for collecting cri-
minimum 1 second period. The 2-second minimum
terion FPPA measure. Arrows indicate placement of reflective total SLS duration was chosen to minimize effect of
markers. velocity on quality and quantity of movement as well
as rater ability to assess. All squat performances were
performed in a controlled manner and were per-
0.562-meter by 1.072-meter rectangular scaling formed over a period of time equal to or greater than
grid and smoothed by using a fourth-order, two-pass 3.4 seconds as determined by review of elgon data.
no phase shift Butterworth digital filter (6 Hz While performing the SLS, the subject was instructed
cutoff). Elgon data were synced with reflective to maintain an upright trunk posture while the video
marker data automatically through the Vicon-Peak images and elgon data were recorded.
system. This was done to ensure that the criterion
standard FPPA measure was always performed at the
lowest point of knee flexion during the SLS perform-
ance. This timing of the FPPA measure also correlated Protocol for creating single leg squat
with when clinician subjects were instructed to video bank
perform measurement of the FPPA on the video
images. The FPPA was calculated as the angle From the original pool of 22 subject videos, video files
between the midline of the leg and midline of the of 12 individuals performing the SLS test were se-
thigh in the frontal plane. The convention used to lected. These 12 videos were categorized by the inves-
interpret FPPA measures was as follows: a positive tigators on the basis of objective quantitative measures
FPPA indicated 2D medial position of the knee and (digital computerized methods, described previously)
a negative FPPA indicated a 2D lateral position of of their SLS performance derived through computer-
the knee. ized measure of reflective marker motion described
A digital video camera (model DCR-HC40; Sony previously. Approximately one third of the individuals
Electronics, Inc., Oradell, NJ, USA) was positioned displayed the least amount of 2D-medial knee move-
approximately 4.5 meters in front of the subject and ment (FPPA < 5°), one third displayed an average
set to record (30 Hz) a frontal plane view of the amount of 2D-medial knee movement (FPPA ≥ 5° to
SLS. The image was adjusted so that the subject was <10°), and one third displayed the most 2D-medial
visible in at least two thirds of the viewing area when knee movement (FPPA ≥ 10°). The FPPA values
the person was in a neutral standing position. The used to define the categories were based on previous
digital video camera and the infrared motion-tracking research findings that reported normal and abnormal
camera were placed by using a floor reference grid to FPPA values displayed during the SLS maneuver by
ensure that they were both recording in an identical healthy adults (DiMattia et al, 2005; Kralj, Jaeger,
distance and angle perpendicular to the frontal plane and Munih, 1990; McKinley and Horowitz, 1992;
of subjects performing the SLS maneuver. Video Nguyen and Shultz, 2007; Salem, Salinas, and
files captured by the digital video camera were later Harding, 2003; Willson and Davis, 2007; Zeller,
used by novice clinician raters to perform movement McCrory, Kibler, and Uhl, 2003; Zeller et al, 2005).
measures (described below) via the ordinal scale and The neutral category is considered representative of
FPPA methods. the average knee frontal plane movement in the

normal healthy adult population. The 2D-medial knee images that clinician raters analyzed. All video files
movement category individuals have a higher degree were observed on a standard 19-inch flat panel LCD
of 2D knee medial movement relative to the average computer display (UltraSharp-1907FPt, Dell Compu-
healthy adult. The 2D-lateral knee movement cat- ter Inc.). Quick Time Player (7.4 Pro; Apple, Inc.)
egory individuals display greater 2D lateral knee software was used to display video data.
movement relative to the average healthy adult. The
Ordinal scale assessment
aim of creating these categories was to develop a
Immediately after watching each individual video,
bank of patient simulation video files that displayed
the rater was asked to measure the SLS performance
varying levels of movement quality and quantity.
for the subject by using the modified specific quality as-
Former research has cited the lack of heterogeneity
sessment tool (Table 1). The term “modified” is used
of movement displayed as a possible cause of limited
here to describe the scale because the original ordinal
visual assessment reliability (Chmielewski et al,
scale used by Chmielewski et al (2007) did not ask
2007; DiMattia et al, 2005).
the rater to apply numerical values to describe the ob-

served kinematics. Chmielewski’s original scale used
Novice clinician raters a randomly assigned group of nonintuitive symbols
that did not directly reflect a change in magnitude of
A convenience sample of six volunteer physical kinematics displayed by the individual performing the
therapy students entering their final semester of SLS. The modified specific method of assessment re-
entry-level education was asked to participate. This quires the clinician to assign one of four numerical
population was specifically chosen because it rep- values (0, 5, 10, or 15) (Table 1) to each of the three seg-
resented a controlled sample of clinician raters who re- ments of interest (trunk, thigh, and pelvis). The sum of
ceived all didactic lecture, lab, and clinical education the segment scores was used for the reliability calcu-
as physical therapists without bias of postgraduate lation. These numerical values increase in magnitude
functional assessment training. Raters were recruited in direct correlation with the descriptions of increasing
during regular lecture and lab instruction. abnormal movement in the ordinal scale definitions.
Novice clinician raters were not limited to the
number of times they were allowed to view each
Evaluation of SLS performance by novice
video. Each rater viewed each individual video an
clinician raters average of three times.
Each novice clinician rater was asked to observe each Frontal plane projection angle assessment
video and assess movement by using the ordinal Following the ordinal scale measurement of the
scale (Table 1) and to measure each subject’s FPPA SLS performance, novice clinician raters were in-
by using a manual goniometer. Video data collected structed to view the video again and to pause the
through use of the Sony DV Camera provided the video at the time when the subject reached the
lowest position while performing the single leg squat.
While the video was paused, the clinician raters then
TABLE 1 Modified qualitative scoring method for evaluating
measured FPPA three times directly from the video
single leg squat
image using a 15-centimeter manual goniometer (E-
Z Read Manual Goniometer, Jamar Inc.). The
Point value Definition
average of these three ratings was used for all reliability
0 No deviation from neutral alignment calculations. The convention used to describe move-
5 A small-magnitude or barely observable ment with the FPPA was as follows: negative value
movement out of a neutral position and/or indicates a medial knee movement and a positive
moderate frequency of segment oscillation value indicates a lateral knee movement (Willson
10 A moderate-magnitude or marked movement
and Davis, 2007; Willson, Dougherty, Ireland, and
out of a neutral position and/or moderate
frequency of segment oscillation Davis, 2005).
15 Excessive or severe magnitude of movement The novice clinician raters were asked to return 4
out of a neutral position and/or high weeks following the initial measurement session to
frequency of segment oscillation repeat all ordinal scale and FPPA measures. These
data were subsequently entered manually into a
Each anatomical segment (trunk, pelvis, and hips/thigh) being
assessed is given a value that correlates with the observed level of spreadsheet by one of the principal investigators.
movement quality. Simulated patient’s score for quality of Both the principal investigator entering the data and
movement is then derived by summing the scores (Chmielewski the novice clinician raters were blinded to measures
et al, 2007). made at the first data collection session.

590 Poulsen et al.
Statistical analysis TABLE 2 Interrater and intrarater reliability of quality of

motion ordinal scale easures
Descriptive statistics of all measurement variables
Interrater qualitative reliability: Coefficient (95% CI)
(qualitative assessment, FPPA obtained via the com-
puterized motion tracking system, and manually Generalized weighted kappa 0.68 (0.46–0.87)
measured FPPA) were compiled and recorded. Intrarater qualitative reliability:
Weighted kappa
Statistical analysis of ordinal scale measures Subject 1: 0.76∗ (0.36–0.93)
Intra- and interrater reliability of the ordinal scale Subject 2: 0.89∗ (0.66–0.97)
measures were calculated by using quadratically weighted Subject 3: 0.70 (0.23–0.90)
kappa as described by Norman and Streiner (1994). Subject 4: 0.38 (-0.22–0.77)
The interrater reliability analysis for both ordinal Subject 5: 0.85∗ (0.56–0.95)
scale measures and FPPA measures was completed Subject 6: 0.94∗ (0.80–0.98)
by using measurements obtained only at the initial ∗

Reliability exceeds value considered necessary for clinical use
data collection session to control for possible matu- (Portney and Watkins, 2009).
ration and learning effect.
Statistical analysis of FPPA measures
Intrarater reliability was calculated for quantitative TABLE 3 Interrater and intrarater reliability of frontal plane
projection angle measures
FPPA measures by using intraclass correlation coeffi-
cient model 3,3. Interrater reliability was calculated
Interrater FPPA reliability: Coefficient (95% CI)
for FPPA measures by using intraclass correlation
coefficient model 2,3. Choice of intraclass correlation Intraclass correlation (ICC) 2,k 0.99∗ (0.97–1.00)
coefficient models was based on recommendations of Intrarater FPPA reliability:
Portney and Watkins (2009). Intraclass correlation(ICC) 3,k

Concurrent validity of FPPA measurements was as- Subject 1: 0.98∗ (0.94–1.00)
sessed by comparing the average values of the manual Subject 2: 0.98∗ (0.94–1.00)
Subject 3: 0.98∗ (0.92–0.99)
goniometric measurements of FPPA and the compu- Subject 4: 0.88∗ (0.59–0.97)
ter-generated criterion standard measurements of knee Subject 5: 0.97∗ (0.89–0.99)
FPPA via Bland-Altman plots (Bland and Altman, Subject 6: 0.97∗ (0.90–0.99)
1986; Bland and Altman, 2010; Hanneman, 2008). A ∗
5° variation in agreement between clinician measures Reliability meets or exceeds value considered necessary for
clinical use (Portney and Watkins, 2009).
and computer-generated measures was determined to
be acceptable a priori. All statistics were calculated by
Bland-Altman plots indicated that novice clinician
using SPSS v.12 and Microsoft Excel 2007.
generated measures of the FPPA differed significantly
from computer-generated measures of the FPPA.
Examination of Bland-Altman scatterplots (Figures
RESULTS 2, 3, 4, 5, 6, and 7) and calculation of upper
(ULOA) and lower limits of agreement (LLOA) indi-
Ordinal scale visual observational cated that variability in difference scores between clin-
assessments of movement quality ician measures and computer-generated measures did
not fall within the a priori-determined limit of 5°
Intrarater ordinal scale assessment reliability ranged (Table 3). Datum points on the Bland-Altman scatter-
from 0.38 to 0.94 (Table 2). Interrater reliability of plots are generated by plotting the difference between
ordinal scale measures determined via generalized the student’s measure and the computer’s measure
quadratically weighted kappa coefficient with two- (Y) versus the mean of the student’s measure and
sided 95% confidence interval was 0.68 (0.46–0.87). the computer’s measure (X) for each subject.
Frontal plane projection angle measures DISCUSSION
Intrarater reliability of FPPA measures ranged from Ordinal scale assessment of movement
0.88 to 0.98 (Table 3). Interrater reliability of frontal
plane projection angle measures with two-sided 95% Results of this study are mixed for the hypothesis that
confidence interval was 0.99 (0.97–1.00). clinicians with minimal clinical experience can display

FIGURE 2 Bland-Altman scatterplot for clinician subject FIGURE 5 Bland-Altman scatterplot for clinician subject
1. Upper and lower limits of agreement (ULOA & LLOA) are 4. Upper and lower limits of agreement (ULOA & LLOA) are
represented by the upper and lower dashed lines, respectively. represented by the upper and lower dashed lines, respectively.
2. Upper and lower limits of agreement (ULOA & LLOA) are 5. Upper and lower limits of agreement (ULOA & LLOA) are
represented by the upper and lower dashed lines, respectively. represented by the upper and lower dashed lines, respectively.
3. Upper and lower limits of agreement . (ULOA & LLOA) 6. Upper and lower limits of agreement (ULOA & LLOA) are
are represented by the upper and lower dashed lines, represented by the upper and lower dashed lines, respectively.
respectively.
the movement of patients over a period of time with

intrarater reliability of ordinal scale measures con- a degree of intrarater reliability necessary for clinical
sidered adequate for clinical use (≥0.75) (Portney use. However, two of the six novice clinicians dis-
and Watkins, 2009). Our results indicate that a played intrarater reliability that would not support
majority of the novice clinician raters could measure this hypothesis when using the ordinal scale.

592 Poulsen et al.
One possible reason for this mixed result in rater 3) Chmielewski et al (2007) discuss that a limitation of
reliability is lack of a more thorough education of their study was the use of an arbitrary, nonintuitive,
raters on consistent performance of measurement symbol system to rank quality of movement; the
using the ordinal scale. In this study the instruction current study used a more intuitive numerical
provided to raters prior to the initial rating session ranking system with equidistant measures that may
was only a brief familiarization to the ordinal scale, have contributed to a more accurate representation
which described the scales purpose and instruction of novice clinician perception of movement.
to apply one definition from the scale to each body
segment. At this instruction period, raters were not
given a standard interpretation of the ordinal scale; it Quantitative measures
was left to each individual rater to interpret the defi-
nition of each level of the ordinal scale leaving room Results suggest that a novice clinician who desires to
for variation in the application of the ordinal scale in- track an individual patient’s FPPA can do so with a
strument when rating subjects. Intrarater reliability degree of reliability that exceeds the minimal clinical
may improve, and exceed an acceptable level, if all standard. All novice clinician quantitative measures
novice clinician raters are instructed in what each exhibited intrarater reliability that was above the level
ordinal scale definition looks like through a prelimi- considered adequate for clinical use (≥0.75) (Portney
nary clinical education that includes examples of and Watkins, 2009). However, examination of
varying levels of stability during a single leg squat. measures of FPPA does not support the hypothesis
Education of this type might allow an opportunity that measures made by novice physical therapists are
for students to develop a static memory of the standard similar to computer-generated criterion standard
method of application of the ordinal scale. Building FPPA measures.
such a memory of consistent interpretation and appli- Because clinician measures of FPPA are reliable,
cation of this assessment method might also result in a both within raters and between raters, there is in-
more reliable measurement from one rating session to creased confidence that these measures can be used
another. in the clinical setting. Previous research by Willson
Interrater reliability of ordinal scale measures failed et al (2008b) using FPPA measures generated by
to reach a level considered adequate for clinical use. hand indicates that the FPPA is a valuable measure
This is in agreement with previous findings of Chmie- of human motion that may be used in the diagnosis
lewski et al (2007). As described above for intrarater and treatment of individuals with lower extremity
reliability, a more thorough and uniform education pathology. Similar findings by Levinger and Gilleard
in the assessment method may improve levels of inter- (2007) are also in agreement.
rater reliability. However, results may also indicate that These results also suggest caution to researchers
this ordinal scale does not lend itself to agreement who may try to apply a computerized method of
between raters due to the scales inherent subjectivity measuring the FPPA with the intention to interpret
and a more refined scale or uniform education may them in the same manner as Willson et al (2007;
not change this. 2008a; 2008b) have in previous publications. It is
Intra- and interrater reliability of ordinal scale not readily apparent why computer-generated
measures in this study was higher than the single pre- measures of the FPPA were different from those gen-
vious study that used a similar specific ordinal scale erated by humans. The hypothesis that they would be
(Chmielewski et al, 2007). Possible explanations for concurrently valid was based on the fact that both the
this include the following: 1) In the study conducted computer and the human raters were “instructed” to
by Chmielewski et al (2007), the raters first scoring measure the FPPA based on the placement of the re-
session was completed by observing subjects in flective markers. It is possible that human error in
person while they performed the functional task, and placing and reading the goniometer or recording the
the second round of observations was completed by FPPA values resulted in significantly different
the raters observing a video of the prior performance. measures than were produced by the computer.
In our study, mode of visualizing the patient perform- However, the high levels of inter-and intrarater
ance was controlled (video only). 2) Chmielewski’s reliability in human measures indicate that human
group did not control for duration between the first error was not likely a substantial source of human
and second round of assessment (10 ± 1.5 weeks), and computer disagreement.
thus introducing the possibility of greater variability Therefore, it is more likely that a difference in the
between raters or a learning or maturation effect; the manner in which the computer and the human
current study controlled precisely for duration measured FPPA existed, causing this difference in
between the first and second round of rater measures. measures. One possible explanation includes the

small aspect difference created by the different place- this artificially improved the ICC values because
ment of the infrared camera and the digital video the within- and between-rater variability might have
camera. This difference in camera placement may been made artificially small relative to the sample
have created enough of a difference in the images pre- variability. It is recommended that future study not
sented to the computer and the human to undermine purposefully create heterogeneity through sample
concurrent validity between the two measurement selection, but rather allow natural variability occur in
methods. The effect could be similar to two humans the pool of subjects being assessed.
standing in two different positions when measuring Results indicate that FPPA measures of movement
the projection angle of any object. A small difference generated by novice clinicians display intra- and inter-
in perspective between the infrared-computer camera rater reliability necessary for clinical application.
and the standard video camera could cause a signifi- However, concurrent validity analysis indicates that
cant difference in measures made from the two differ- what novice clinicians are measuring during FPPA
ent videos generated. analysis may not be the same as the FPPA measures
The current study indicates that the reliability of the generated by 2D computerized motion analysis
methods of movement measurement described shows systems. This is encouraging because it indicates that
levels of reliability near or above that considered ade- even novice clinicians can reliably generate measures
quate for clinical use. However, results also indicate of 2D knee movement. It also highlights a piece of
that quantitative measures of the FPPA derived information not previously known; it is possible that
through inexpensive manual goniometric methods what clinicians measure when asked to assess knee
are not concurrently valid compared to criterion FPPA during a single leg squat may be different
standard measures derived through 2D computerized from measures generated by a high-speed 2D motion
motion-tracking equipment. analysis system. Therefore, until the source of this dis-
agreement is determined and ameliorated, researchers
should proceed with caution when interpreting
LIMITATIONS AND CONCLUSIONS computer-generated FPPA measures using previous

research that relied on handmade measures using
The results of this study do not shed light on the val- standard video equipment.
idity of motion analysis through the ordinal scale. Future investigation should also continue to focus on
Additional study should be focused on use of the modi- the interpretation and clinical validity of FPPA measures
fied ordinal scale and its relationship to musculoskeletal generated by clinicians. Knowing the relationship of
pathology. Although novice clinicians were able to human-generated measures to pathology is important
perform consistent ordinal scale measures across time because the ultimate use of FPPA in the clinical setting
and between raters, the ordinal scale used should be es- will likely be those generated by clinicians and not
tablished as a valid clinical examination tool that pro- expensive computerized motion-tracking systems.
vides information useful for patient management. Translation of this technique into the clinical
Raters in this study performed measures of move- setting has also yet to be investigated. The ability to
ment through viewing video images. Most use of teach a patient the single leg squat and then video
movement assessment in the clinical setting is likely record this data takes approximately 2–3 minutes.
to be through watching patients move in a live situ- With the recent advances in video-capturing devices
ation. Therefore, the implications for reliability and that automatically transfer to computer memory, it is
validity of movement in the clinical setting remain possible that measuring the FPPA in the clinical
unknown. It is possible that viewing video images setting may only take a trained clinician approximately
may result in measures that are identical to, or very 3–5 minutes to capture video data, find the point of
similar to, measures created through watching live peak knee angle during descent, and measure the
action in the clinic. However, no evidence exists in FPPA. Considering the reliability of this measure
the literature for the correlation between measures shown here and the indication from previous study
made through live action and those made through by Willson et al that the FPPA holds valuable infor-
viewing video images. Despite this limitation, using mation for the assessment of those suffering from
video images was the only method to limit the intro- lower extremity pathology, it appears reasonable for
duction of error that might occur due to variation in clinicians to begin exploring the use of this measure-
what was being assessed by the raters. ment method as an additional tool for assessing
Lastly, the pool of videos being assessed was artifi- clients with lower extremity pathology. It is likely
cially composed of an equal number of individuals in that if clinicians begin adding this as a regular
each of the three categories of frontal plane knee angle measurement tool, the understanding of the value of
to create heterogeneity in the sample. It is possible that this technique will be greatly expanded, and previously

594 Poulsen et al.
overlooked coordination deficits in the frontal plane Yu B 2006 Understanding and preventing noncontact anterior
will be more readily treated through neuromuscular cruciate ligament injuries: A review of the Hunt Valley II
meeting, January 2005. American Journal of Sports Medicine
coordination and therapeutic exercise. 34: 1512–1532
Future investigation should also analyze if there is a Hanneman SK 2008 Design, analysis, and interpretation of
significant difference between novice raters and ad- method-comparison studies. AACN Advanced Critical Care
vanced professionals to determine if level of experi- 19: 223–234
ence affects validity of measures made. If this is so, Hewett TE, Myer GD, Ford KR, Heidt RS, Colosimo AJ, McLean
SG, van den Bogert AJ, Paterno MV, Succop P 2005 Biomecha-
it is possible that experience and training play a role nical measures of neuromuscular control and valgus loading of
in validity of clinician measures indicating a need for the knee predict anterior cruciate ligament injury risk in female
a period of training or experience in clinical practice athletes: A prospective study. American Journal of Sports Medi-
before measures of FPPA can be considered valid. cine 33: 492–501
In conclusion, the reliability of novice clinician Kralj A, Jaeger RJ, Munih M 1990 Analysis of standing up and
sitting down in humans: Definitions and normative data presen-
ordinal scale measures of human motion failed to
tation. Journal of Biomechanics 23: 1123–1138

reach levels necessary for clinical application. Concur- Levinger P, Gilleard W 2007 Tibia and rearfoot motion and ground
rent validity assessment of clinician-generated FPPA reaction forces in subjects with patellofemoral pain syndrome
measures, as determined by comparison with compu- during walking. Gait and Posture 25: 2–8
ter-generated FPPA measures, indicates that clinician McKinley PA, Horowitz M 1992 Kinematics of lead and lag legs
during step-downs from two heights. Woollacott, M., Horak,
measures are significantly different from those gener- F. Posture and gait: Control mechanisms, vol. 1, pp 348–
ated by 2D computer motion analysis. However, 351. XIth International Symposium of the Society for Postural
both intrarater and interrater reliability of FPPA and Gait Research
measures successfully exceeded levels necessary for McLean SG, Huang X, van den Bogert AJ 2005 Association
application of this measurement method in the clinical between lower extremity posture at contact and peak knee
valgus moment during sidestepping: Implications for ACL
setting. Future study should be undertaken to injury. Clinical Biomechanics 20: 863–870
determine what relationships exist between novice
Nguyen AD, Shultz SJ 2007 Sex differences in clinical measures of

clinician FPPA measures, experienced clinician lower extremity alignment. Journal of Orthopaedic and Sports
FPPA measures, and musculoskeletal pathology to Physical Therapy 37: 389–398
determine the value of this clinical measure. Norman GR, Streiner DL 1994 Biostatistics: The bare essentials.
St Louis, MO, Mosby-Year Book, Inc
Portney LG, Watkins MP 2009 Foundations of clinical research:
Declaration of interest: The authors report no Applications to practice, 3rd edn. Upper Saddle River, NJ,
conflicts of interest. The authors are responsible for Pearson/Prentice Hall
the content and writing of the article. Salem GJ, Salinas R, Harding FV 2003 Bilateral kinematic and
kinetic analysis of the squat exercise after anterior cruciate liga-
ment reconstruction. Archives of Physical Medicine and Rehabi-
litation 84: 1211–1216
REFERENCES Willson JD, Davis I 2007 Knee frontal plane projection angle during
single leg squats in females with and without patellofemoral pain
Bland JM, Altman DG 1986 Statistical methods for assessing agree- syndrome. American Physical Therapy Association: Combined
ment between two methods of clinical measurement. Lancet 1 Sections Meeting
(8476): 307–310 Willson JD, Davis IS 2008a Utility of the frontal plane projection
Bland JM, Altman DG 2010 Statistical methods for assessing agree- angle in females with patellofemoral pain. Journal of Orthopae-
ment between two methods of clinical measurement. Inter- dic and Sports Physical Therapy 38: 606–615
national Journal of Nursing Studies 47: 931–936 Willson JD, Davis IS 2008b Lower extremity mechanics of females
Chmielewski TL, Hodges MJ, Horodyski M, Bishop MD, Conrad with and without patellofemoral pain across activities with pro-
BP, Tillman SM 2007 Investigation of clinician agreement in gressively greater task demands. Clinical Biomechanics 23:
evaluating movement quality during unilateral lower extremity 203–211
functional tasks: A comparison of 2 rating methods. Journal of Willson JD, Dougherty CP, Ireland ML, Davis IM 2005 Core stab-
Orthopaedic and Sports Physical Therapy 37: 122–129 ility and its relationship to lower extremity function and injury.
DiMattia MA, Livengood AL, Uhl TL, Mattacola CG, Malone TR Journal of the American Academy of Orthopaedic Surgeons
2005 What are the validity of the single-leg-squat test and its 13: 316–325
relationship to hip-abduction strength? Journal of Sport Rehabi- Zeller BL, McCrory JL, Hall JW, Kernozek TW, Shaprio R 2005
litation 14: 108–123 Kinematic and kinetic differences of the hip and knee In
Griffin LY, Albohm MJ, Arendt EA, Bahr R, Beynnon BD, Demaio female and male athletes during two functional manoeuvres.
M, Dick RW, Engebretsen L, Garrett WE, Hannafin JA, British Journal of Sports Medicine 39: 408
Hewett TE, Huston LJ, Ireland ML, Johnson RJ, Lephart S, Zeller BL, McCrory JL, Kibler WB, Uhl TL 2003 Differences in
Mandelbaum BR, Mann BJ, Marks PH, Marshall SW, kinematics and electromyographic activity between men and
Myklebust G, Noyes FR, Powers C, Shields C, Jr, Shultz SJ, women during the single-legged squat. American Journal of
Silvers H, Slauterbeck J, Taylor DC, Teitz CC, Wojtys EM, Sports Medicine 31: 449–456

Concurrent Validity and Reliability of Clinical Evaluation of The Single Leg Squat

Hochgeladen von

Dokumentinformationen

Originaltitel

Copyright

Verfügbare Formate

Dieses Dokument teilen

Dokument teilen oder einbetten

Freigabeoptionen

Stufen Sie dieses Dokument als nützlich ein?

Sind diese Inhalte unangemessen?

Copyright:

Verfügbare Formate

Concurrent Validity and Reliability of Clinical Evaluation of The Single Leg Squat

Hochgeladen von

Copyright:

Verfügbare Formate

Physiotherapy Theory and Practice, 27(8):586–594, 2011

Copyright © Informa Healthcare USA, Inc.

CLINICAL TECHNICAL NOTE