Beruflich Dokumente
Kultur Dokumente
Reliability
From the perspective of classical test
theory, an examinee's obtained test
score (X) is composed of two
components, a true score component
(T) and an error component (E):
X=T+E
Reliability
The true score component reflects the
examinee's status with regard to the
attribute that is measured by the test,
while the error component represents
measurement error.
Consistency = Reliability
The Reliability Coefficient
Test Length
Range of Test Scores
Guessing
1. Test Length:
The larger the sample of the attribute being
measured by a test, the less the relative
effects of measurement error and the more
likely the sample will provide dependable,
consistent information.
Where:
SEmeas = standard error of measurement
SDx = standard deviation of test scores
rxx= reliability coefficient
Interpretation of Standard
Error of Measurement
As shown by the formula, the magnitude of
the standard error is affected by two factors:
T’=a + bX
=(1-rxx )X + rxx X
T’=(1-.84) x 70 + .84 x 80
=.16 x 70 + .84 x 80
=11.2 + 67=78.2
The Reliability of Difference
Scores
A test user is sometimes interested in
comparing the performance of an
examinee on two different tests or
subtests and, therefore, computes a
difference score. An educational
psychologist, for instance, might
calculate the difference between a
child's WISC-IV Verbal and
Performance scores.
The Reliability of Difference
Scores
When doing so, it is important to keep in mind that the
reliability coefficient for the difference scores can be no
larger than the average of the reliabilities of the two
tests or subtests:
A1 rA1A1 (.93)
A1 rA1A1 (.93)
OR