Validaity and Reliability

Testing
Validity &
Reliability
Presented By:
Sunil Kumar Saroha
Reliability of Test
Thereliability of a testrefers to stability of
measurement over time. When a person's data
entry skills are measured on two occasions
(with no special training in between), the two
sets of scores should be similar. Reliability is
often measured with areliability coefficient,
which is simply a correlation between sets of
scores from people who have been given the
test on two occasions (X = first time score on
the test, Y = second time score on the test)
There are three (3) ways to

measure the reliability of a
test or inventory:
1.Test-retest
2.Split-half
3.Alternate forms.
Test-Retest
Test-retest - the same test is
given to the same people on
two occasions. The scores are
correlated, and if the reliability
coefficient is positive and
high, the test is reliable.
Example of TestRetest
Test-retest - Take the same exam (retest) a week later.

Student ID
1
2
3
4
5
Exam score (60 possible)

45
58
32
54
48
Repeat score (same exam)

49
60
41
58
54
Split-Half
Split-half- after being taken by a
sample, the answers to the test are
divided into two halves (e.g., the
odd-numbered versus the evennumbered items). Scores on each
half are correlated. If the test is
reliable, the scores on the two
Example of Split-Halt
Split-half- Each person has two scores, each on 1/2 of the
exam.
Score on even-numbered Score on odd-numbered
Student ID
items (30 possible)
items (30 possible)
1
23
24
2
30
29
3
26
25
4
19
20
5
24
23
Alternate Forms
Alternate forms- two versions of the
test are constructed, and given to
the same people on two occasions.
Scores on the two forms should show
a high positive reliabiilty coefficient
(correlation).
Example of Alternate Forms

Alternate forms- There are 2differentforms of the test. Same person
takes test at two different times.
Student ID
1
2
3
4
5
Exam A score (60

possible)
57
54
41
32
58
Exam B score (60

possible)
58
56
45
41
57
Testing Validity
The internal validity of a test is the extent to

which it measures what it is supposed to
measure. The external validity of a test refers
to how well it can be generalized to others in
the population for which it was developed.
There are 3 additional criteria or

types of evidence for test validity:
1.Construct Validity
2.Content Validity
3.Criterion Validity
Construct validity
Construct validity- refers to the association of

the test with an underlying theory. Does the
test produce a result that is in accord with an
established theory? For example, according to
psychoanalytic theory, anxiety is a common
response to stressful events. A test that
shows an increase in anxiety in people who
have been traumatized would possess
construct validity.
Content validity
Content validity - sometimes known as "face
validity." On the face of it, does the test or
inventory look like it measures what it is
supposed to measure? Does the content make
sense? An item asking about how gears work
would appear to measure mechanical ability.
Asking someone to indicate on a checklist
whether they are feeling happy or sad sounds
like a good way to find about that person's
Criterion validity
Criterion validity- do the test results match up

with other known measures of a characteristics
(the criterion)?
There are two (2) types of criterion validity:
1. Concurrent
2. Predictive.
Concurrent Validity
Concurrent validity- a test result is

compared with another indicator (criterion)at
the same time. For example, to check the
concurrent validity of the State Anxiety Scale, it
could be administered at the same time as the
older, established Taylor Manifest Anxiety
Scale. If both scales give the same result, then
the newer one possesses criterion validity. It is
valid because it gives the same result as a
Predictive Validity
Predictive validity- the test result predicts

alater outcome. Police selection example:
Applicants who scored low on the test (but
passed and were hired) had more discipinary
citations in their records a year later than
person who scored high on the test. The test
has predictive validity in that it predicts
(fortells) subsequent behavior.
Criteria for Good Measurement
There are three basic criteria for good

measurement:
1.Validity
2.Reliability
3.Sensitivity

Validaity and Reliability

Hochgeladen von

Dokumentinformationen

Originalbeschreibung:

Copyright

Verfügbare Formate

Dieses Dokument teilen

Dokument teilen oder einbetten

Freigabeoptionen

Stufen Sie dieses Dokument als nützlich ein?

Sind diese Inhalte unangemessen?

Copyright:

Verfügbare Formate

Validaity and Reliability

Hochgeladen von

Copyright:

Verfügbare Formate

Testing

There are three (3) ways to

Test-retest - Take the same exam (retest) a week later.

Exam score (60 possible)

Repeat score (same exam)

Example of Alternate Forms

Exam A score (60

Exam B score (60

The internal validity of a test is the extent to

There are 3 additional criteria or

Construct validity- refers to the association of

Criterion validity- do the test results match up

Concurrent validity- a test result is

Predictive validity- the test result predicts

Criteria for Good Measurement

There are three basic criteria for good

Das könnte Ihnen auch gefallen