Beruflich Dokumente
Kultur Dokumente
List of tables ii
List of figures iii
1 Introduction 1
2 Purpose of the Report 1
3 Test Analysis 1
3.1 Descriptive statistics 1-2
i
List of tables
ii
List of figures
iii
1. Introduction
Tests are administered for different reasons but the main one is to
inform decisions. Tests compilers should therefore always strive to
compile tests that are relevant to their objectives.
Statistical analysis of tests and test items makes this possible in the
following way:
The internal consistency and reliability of tests can be checked and
analysed by using the statistical measures of central tendency.
Tests and test items can also be analysed by using the difficulty and
discrimination indices.
3. Test Analysis
Description Values
Number of students who took the test 25
Total score 1645
Mean 65.79
Standard deviation 21.90
Mode 65
Median 65
From the table above we can see that the mean = median = mode =
65. Graphically we can represent this by a normal distribution curve
in this way:
A test has the highest as well as the lowest scores. Table 2 below
shows these scores obtained from our multiple choice question
tests. The difference between the highest and the lowest score is
known as the range. We arbitrarily chose the size of our interval to
be 9 and the interval 10 see table below:
H Value 100.00
L Value 15.00
Range 85.00
No. of Scores 25
No. of Interval 10
Size of Interval 9
3.2.1 Histogram
3.
Figure 2: Histogram
Frequency Histogram
7
6
5
4
3
Frequency
2
1
0
35-44 55-64 65-74 95-104
15-24 25-34 45-54 75-84 85-94
Interval
From the above figure, we can easily see that most of the scores are
above the mean (65). This means that most students did well in this
test. If this information were to be indicated in a curve, we would
have a negatively skewed graph.
4.
3.2.2 Frequency polygon
The frequency polygon is a line graph. The middle values i.e. (the
sum of the lower and higher values divided by two) see table 3
above for middle values, is plotted against the frequency. Figure 2
below is a frequency polygon with two peaks at a frequency of 6 and
middle values of 69.5 and 89.5 respectively.
Frequency Polygon
7
6
69.5 89.5
5
4
49.5
3
59.5
2 99.5
Frequency 29.5
1
19.5 79.5
0
39.5
19.5 29.5 39.5 49.5 59.5 69.5 79.5 89.5 99.5
Middle values
3.2.3 Ogive
5.
Figure 4: Ogive
Cumulative frequency graph (An Ogive)
30
25
20
15
Cumulative 10
5
0
Frequ
ency 24 34 44 54 64 74 84 94 104
Upper Values
K 20
K-1 19
TOTAL (pq) 3.83 6.
STDEV 21.90 The test reliability coefficient in
(STDEV )2 479.57 this case is 1.04, an indication
K20 1.04 of a perfect reliability.
4. Item analysis
Item analysis is a process whereby students’ responses to test items
are examined to assess the quality and accuracy of such test items.
Items that need to be eliminated or improved are exposed during
this process. Decisions on item analysis are informed by the
discrimination and the difficulty indices.
7.
Table 5: Interpretation of the difficulty level of the questions
From the above information one can easily see that only twelve
items have a p-value >0.25 but < 0.75 thus these items are good
and they can be retained, the items are q3, q4, q6, q7, q8, q9, q10,
q13, q17, q18, q19 and q20. Items q1, q2, q11, q12, q14, q15, q16
have a p value > 0.75 and are therefore bad questions.
9.
5. Conclusion
The scores in this test are distributed normally as all the measures
of central tendency are equal. This implies that most of the test
items are meeting the objectives. Eight out of twenty questions viz
(q1, q2, q5, q11, q12, q14, q15 and q14) need improvement or
they should be eliminated as they are too easy, the value of the
difficulty index is very high although these items discriminate
positively (the values of the discrimination indices are positive).
The reliability coefficient of the test is above 1, this implies that the
test is consistent. If the above-mentioned eight questions are
improved, the test will meet the educator’s objectives perfectly.
10.
6. References
11.
7. Appendices
KEY C B D D B C D A
STUDENT
No. Q1 Q2 Q3 Q4 Q5 Q6 Q7 Q8
1 C B B A C D A
2 C B D D B D A A
3 C B D D B C D A
4 C B D B B C B A
5 C B D C B C B A
6 C A D D C C A D
7 B B A B B C B B
8 C B D B B C B D
9 C B D A B C D D
10 C B B A B C D C
11 C B D D B C D A
12 C B D D B C D D
13 C B D A B C D A
14 C B D A B C D A
15 C B D D B B A A
16 C B D D B C D A
17 B B C C B A D D
18 C B B D B A D D
19 D C A D B A B A
20 C B D D B C D A
21 C A D D C C A D
22 B B A B B C B B
23 C B D B B C B D
24 C B B A C D A
25 C B D D B D A A
12.
C B A C B D A A C D
Q9 Q10 Q11 Q12 Q13 Q14 Q15 Q16 Q17 Q18
D D A D A A A A C B
C B A C B D A A C D
C B A C B D A A C B
C B A C A D C A C B
C D A C B D A A A B
C D A C A D A A A B
D D A C B D C A A D
B C A C B D A A C A
B D A C B D A A C B
D C A B A D D A C D
C B A C B D A A C D
D A A C A D A A C B
C B A C B D A A A B
C B A C B D A A A
B D A C D A A C B B
C B A C B D A A C D
C A D B D A C
D D A C A D A A C B
D C C D A A D B B B
C A C D B D A A C D
C D A C A D A A A B
D D A C B D C A A D
B C A C B D A A C A
D D A D A A A A C B
C B A C B D A A C D
13.
B C
Q19 Q20
D B
B C
D C
C C
B C
D C
D C
B A
D A
B C
B C
B D
B C
B C
D D
B C
A D
B C
A B
B C
D C
D C
B A
D B
B C
14.
Appendix B: Test Reliability
15.
Appendix C: Number of students in Upper and Lower group