Beruflich Dokumente
Kultur Dokumente
.
Measurements were taken according to identiable anatomic
landmarks, avoiding the estimation of each exact joint rotational
axis. The ability to zoom in on the electronic image made identi-
cation of anatomic landmarks very easy.
2.4. Statistical analysis
The distribution of each variable was examined using the
ShapiroeWilk normality tests and homogeneity of variance was veri-
ed with a Levene test. Data are presented as either means with 90%
condence limits (90% CL) (Hopkin, 2009). The reliability of the overall
AFSE method was rst assessed while analysing the standardized
differences (Cohens d) between the angles measured during the two
testing sessions. If Cohens d < 0.2 and the 90% condence interval
included zero, we considered that there was no substantial between-
day difference. The spreadsheet of Hopkins (2010) was then used to
determine the intraclass correlation coefcient (ICC) and the typical
error of measurement (TE, s or cm), expressed as a coefcient of vari-
ation (CV, %). While a Bland & Altman test is often used in reliability
studies (Nussbaumer et al., 2010; Peeler &Anderson, 2008), it was not
used in the present study given its limitations to examine within-
subjects variations (Hopkins, 2000; Hopkins, Marshall, Batterham, &
Hanin, 2009). Additionally, it is important to acknowledge that
havingthebest reliabilitydoes not meanameasureis themost useful
at monitoring something valuable, as a number of physiological
measures have high reliability but may not be sensitive measurement
tools (Hopkins, 2000). Therefore, CV values were not interpreted as
good, moderate or poor as regularly proposed. Possible reliability
differences between operators, analysers and the different operators/
analysers combinations wereassessedbycomparingtheaverageof the
8 coefcients of variation (see below) obtained by each operators,
analysers and operators/analysers combinations for each of the 8
muscle groups. The magnitude of difference between the two
consecutive testing sessions or between the CV obtained for the
different analysers, operators and operators/analysers combinations
was expressed as standardized mean differences (Cohens d). Criteria
used to interpret the magnitude of the Cohens d were: <0.2 trivial,
0.2e0.5 small, 0.5e0.8 moderate, >0.8 large (Hopkins & Hewson,
2001). Additionally, the chances that the angles or CV values were
greater (i.e., greater thanthesmallest practicallyimportant effect, or the
smallest worthwhile change (SWC) [0.2 multiplied by the between-
subject standard deviation, based on Cohens d principle]), similar or
smaller than the other day (angles measures) or the other analysers,
operators and operators/analysers combinations (between-staff reli-
ability comparisons) were calculated (Cohen, 1988). Quantitative
chances of substantial differences were assessed qualitatively as
follows: <1%, almost certainly not; 1e5%, very unlikely; 5e25%, unlikely;
25e75%, possible; 75e95%, likely; 95e99, very likely; >99%, almost
certain. If the chance of having greater or lower values were both >5%,
the true difference was assessed as unclear (Hopkins et al., 2009).
3. Results
Ideally, 2880 angles could be analysed, but 425 pairs of data
were lost between day 1 and day 2 (e.g., poor quality of some video
clips or missing les as when operators did not perform the test for
Table 1
Measures of reliability for each lower limb muscle group.
n Hip exors Hamstring Soleus Gastrocnemius Adductors Quadriceps Hip medial rotators Hip lateral rotators
128 128 125 128 128 124 126 128
Endpoint
angle (
)
157 (6) 160 (11) 59 (10) 74 (7) 53 (9) 48 (11) 53 (15) 45 (14)
CV (90% CL) 2.6 (2.3; 2.9) 3.3 (3.0; 3.7) 4.5 (4.0; 5.0) 5.7 (5.1; 6.3) 7.2 (6.5; 8.0) 8.3 (7.5; 9.3) 9.6 (8.6; 10.8) 12.4 (12.2; 14.0)
Difference
(Cohens d)
(90% CL)
0.01 (0.2; 0.2) 0.06 (0.3; 0.1) 0.01 (0.2; 0.2) 0.10 (0.1; 0.3) 0.05 (0.2; 0.3) 0.05 (0.3; 0.2) 0.04 (0.2; 0.2) 0.13 (0.3; 0.1)
ICC (90% CL) 0.51 (0.39; 0.61) 0.80 (0.74; 0.85) 0.93 (0.90; 0.95) 0.66 (0.57; 0.73) 0.85 (0.81; 0.89) 0.86 (0.82; 0.89) 0.92 (0.89; 0.94) 0.91 (0.88; 0.93)
Mean (SD) endpoint angle, typical error of measurement expressed as a coefcient of variation (CV, %; 90% CL), standardized between-day differences (Cohens d; 90% CL) and
intraclass correlation coefcient (ICC) for each lower limb muscle group.
F. Fourchet et al. / Physical Therapy in Sport 14 (2013) 28e34 31
one of the eight muscle groups). The entire ow-chart of partici-
pants including drop outs is represented in Fig. 1
3.1. Measure of reliability for each lower limb muscle group
Depending on the muscle group considered, we observed CV
values ranging from2.6 to12.4%(Table 1 andFig. 3). Table 2 presents
magnitude-based differences in CV between the different muscle
groups: the lower CV was observed for hip exors, while the worse
(i.e., the greater value) was noted for hip lateral rotators (Fig. 3).
Between-muscle groups comparisons were substantial for all paired
comparisons (all Cohens d rated as large and differences at least as
very likely). The associated ICC values are presented in Table 1.
3.2. Impact of operators and/or analysers on the reliability of
measurements
Fig. 4 shows that there was no substantial difference in CV,
neither between the operators, nor between-analysers (all Cohens
d rated as trivial and differences as unclear). As shown in Table 3,
there was also no substantial difference between all the possible
operators/analysers combinations (all Cohens d rated as trivial and
differences as unclear).
4. Discussion
The purpose of this study was to evaluate the reliability level of an
innovative method to assess the exibility of lower limbs muscle
groups in highly-trained adolescent athletes, i.e., the Angle at Force
Standardized Endpoint (AFSE). The second aim of the study was to
examine whether interchanging the operators and/or the video ana-
lysers was likely to affect the reliability of the measures. The results
revealed that the coefcient of variation for the between-muscle
groups comparisons ranged from 2.6 to 12.4%, depending on the
muscle group considered and there was no substantial difference
between all the possible operators/analysers combinations.
4.1. Reliability assessment
Most previous reliability studies used the intraclass correlation
coefcient (ICC) as a measure of reliability (Berryman Reese,
2002; Bolhin, Sandstrom, Angstrom, & Lindstrom, 2005).
T
a
b
l
e
2
M
a
g
n
i
t
u
d
e
-
b
a
s
e
d
i
n
f
e
r
e
n
c
e
s
f
o
r
m
e
a
n
d
i
f
f
e
r
e
n
c
e
i
n
C
V
b
e
t
w
e
e
n
t
h
e
d
i
f
f
e
r
e
n
t
m
u
s
c
l
e
g
r
o
u
p
s
.
G
a
s
t
r
o
H
a
m
s
t
r
i
n
g
H
i
p
e
x
o
r
s
H
i
p
l
a
t
e
r
a
l
r
o
t
a
t
o
r
s
H
i
p
m
e
d
i
a
l
r
o
t
a
t
o
r
s
Q
u
a
d
r
i
c
e
p
s
S
o
l
e
u
s
A
d
d
u
c
t
o
r
s
1
.
3
(
0
.
3
;
2
.
3
)
9
7
/
2
/
1
4
.
6
(
3
.
7
;
5
.
6
)
1
0
0
/
0
/
0
5
.
7
(
4
.
7
;
6
.
7
)
1
0
0
/
0
/
0
1
.
7
(
2
.
7
;
0
.
8
)
0
/
1
/
9
9
1
.
7
(
2
.
6
;
0
.
7
)
0
/
1
/
9
9
1
.
1
(
2
.
1
;
0
.
2
)
2
/
4
/
9
4
2
.
8
(
1
.
8
;
3
.
8
)
1
0
0
/
0
/
0
G
a
s
t
r
o
c
n
e
m
i
u
s
e
2
.
5
(
1
.
6
;
3
.
5
)
1
0
0
/
0
/
0
3
.
7
(
2
.
7
;
4
.
7
)
1
0
0
/
0
/
0
2
.
5
(
3
.
5
;
1
.
5
)
0
/
0
/
1
0
0
2
.
7
(
3
.
7
;
1
.
7
)
0
/
0
/
1
0
0
2
.
3
(
3
.
3
;
1
.
3
)
0
/
0
/
1
0
0
1
.
1
(
0
.
1
;
2
.
0
)
9
3
/
5
/
2
H
a
m
s
t
r
i
n
g
e
e
1
.
6
(
0
.
6
;
2
.
6
)
9
9
/
1
/
0
4
.
4
(
5
.
4
;
3
.
4
)
0
/
0
/
1
0
0
6
.
2
(
7
.
2
;
5
.
2
)
0
/
0
/
1
0
0
6
.
5
(
7
.
4
;
5
.
5
)
0
/
0
/
1
0
0
1
.
8
(
2
.
8
;
0
.
8
)
0
/
1
/
9
9
H
i
p
F
l
e
x
o
r
s
e
e
e
5
.
3
(
6
.
2
;
4
.
3
)
0
/
0
/
1
0
0
7
.
1
(
8
.
1
;
6
.
2
)
0
/
0
/
1
0
0
7
.
3
(
8
.
3
;
6
.
4
)
0
/
0
/
1
0
0
3
.
1
(
4
.
1
;
2
.
2
)
0
/
0
/
1
0
0
H
i
p
L
a
t
e
r
a
l
R
o
t
a
t
o
r
s
e
e
e
e
0
.
7
(
0
.
3
;
1
.
7
)
8
1
/
1
3
/
7
1
.
1
(
0
.
2
;
2
.
1
)
9
4
/
4
/
2
3
.
4
(
2
.
4
;
4
.
4
)
1
0
0
/
0
/
0
H
i
p
M
e
d
i
a
l
R
o
t
a
t
o
r
s
e
e
e
e
e
0
.
7
(
0
.
2
;
1
.
7
)
8
4
/
1
1
/
5
4
.
4
(
3
.
4
;
5
.
4
)
1
0
0
/
0
/
0
Q
u
a
d
r
i
c
e
p
s
e
e
e
e
e
e
4
.
3
(
3
.
3
;
5
.
3
)
1
0
0
/
0
/
0
V
a
l
u
e
s
r
e
p
r
e
s
e
n
t
m
e
a
n
s
t
a
n
d
a
r
d
i
z
e
d
d
i
f
f
e
r
e
n
c
e
(
C
o
h
e
n
s
d
;
9
0
%
C
L
)
a
n
d
p
e
r
c
e
n
t
a
g
e
o
f
c
h
a
n
c
e
o
f
h
a
v
i
n
g
g
r
e
a
t
e
r
/
s
i
m
i
l
a
r
/
l
o
w
e
r
C
V
v
a
l
u
e
f
o
r
t
h
e
m
u
s
c
l
e
g
r
o
u
p
i
n
t
h
e
l
e
f
t
c
o
l
u
m
n
c
o
m
p
a
r
e
d
w
i
t
h
t
h
e
o
n
e
o
n
t
h
e
h
e
a
d
i
n
g
r
o
w
.
H
i
p
F
l
e
x
o
r
s
H
i
p
M
e
d
r
o
t
.
H
i
p
L
a
t
r
o
t
.
H
a
m
s
t
r
i
n
g
Q
u
a
d
r
i
c
e
p
s
A
d
d
u
c
t
o
r
s
S
o
l
e
u
s
G
a
s
t
r
o
(
%
)
f
r
o
m
t
h
e
m
e
a
n
C
V
-80
-60
-40
-20
0
20
40
60
80
100
Fig. 3. Measures of reliability for each lower limb muscle group (i.e., coefcient of
variation, CV) expressed as a difference (D, %) from the mean CV for all muscle groups.
Abbreviations: Hip Med rot, Hip medial rotators; Hip Lat rot, Hip lateral rotators;
Gastro, Gastrocnemius.
F. Fourchet et al. / Physical Therapy in Sport 14 (2013) 28e34 32
Nevertheless, the ICC is sample size dependent and largely
affected by the heterogeneity of the between-subject measures
(Weir, 2005). Moreover ICC does not provide an index of the
expected trial-to-trial noise in the data, but rather reects the
ability of a test to differentiate between individuals (i.e., relative
reliability, (Weir, 2005)). Since in the present study, the main
objective was to examine the reliability of intra-subjects
measures, ICC values were not relevant. The ICC obtained with
AFSE are therefore provided for the readers information in
Table 1, but will not be discussed in the following paragraphs.
Hopkins (2000) therefore proposed to use the typical error of
measurement as the most appropriate measure of absolute reli-
ability in the applied eld setting. Typical error of measurement
represents the noise occurring from trial-to-trial, which might
confound the assessment of real changes in repeated measures
(i.e., when monitoring changes in athletes).
4.2. Reliability of measurement for each lower limb muscle group
When angle measures were repeated over different days with
AFSE, we observed CV values ranging from 2.6 to 12.4%. While
there were substantial differences in reliability between the
different muscle groups considered (Table 1 and Fig. 3), most of
the CV values were within the ranges previously reported in
the literature for similar muscle groups but using different
methods. For example, in the present study, the CV for hamstring
was 3.3%, which was consistent with the data of Fredriksen
et al. (1997). Involving only 2 operators and 2 subjects, these
workers reported CV ranging from 0.8% to 3.2% (Fredriksen et al.,
1997). In another experiment, reliability of an isokinetic-assisted
hamstrings stretching protocol yielded a CV of 5.8e6.5%
(Magnusson et al., 1997). When Peeler and Anderson (2008)
recently explored the exibility of the rectus femoris with
a modied Thomas test, they reported a CV value of 13%. In our
study, the CV for the quadriceps was 8.3%. More recently, the
reliability of several hip ranges of motion measures was assessed
with an electromagnetic tracking system (Nussbaumer et al.,
2010). We reported similar CV values as these authors (i.e., 7.2%
for abduction and 9.6% for medial rotation in our study vs. 5.6%
and 10.2%, respectively in their study).
As mentioned earlier, it is worth noting that the reliability of the
measures was muscle group-dependent: the CVs for the hip rotators
were 9.6% and 12.4%, while that for the hip exors was only 2.6%. We
postulate that these differences might be related to variability in
standardisation for some measures. With regards to hip rotators
measures, both techniques (hip at 0
or hip at 90
in exion as we did)
canbeconsideredas equivalent interms of reliability(BerrymanReese,
2002). This question remains however controversial, as Benell et al.
(1999) or Van Dillen (2008) suggested that positioning the hip at
0