Beruflich Dokumente
Kultur Dokumente
Abstract
Any learner, from any background, in any school will find themselves at some point
faced with the inevitable: The assessment. Once a terrifying prospect, nowadays not
so but always with great benefits in the learning process. Regardless of the degree
of students’ participation in the process, the assessment dominates students'
learning. The time that assessment was a monolithic work of the teacher has long
passed and the assessed is no more passive. (Brown, Bull and Pendlebury, 1997)
They bring in the class an amount of pre-existing knowledge (Klibanoff et al, 2006)
so wide and rich that cannot and should not pass unobserved by any informed
assessment system.
0
EDUC43430: Self Directed Study Konstantinos Nassos
sources are mostly empirical data that include knowledge, experiences and skills
that students bring in class. Assessment data have the following origins (Kuh, G.D.;
Jankowski, N.; Ikenberry, S.O. 2014):
● Examination of students work
● Assessment of the achievements of learning objectives
● Conclusions on the quality of learning from other data
The foci of assessment are as many as the entities involved (Nelson, Robert;
Dawson, Phillip 2014):
In an educational context the term “assessment” did not appear before the end of the
Second World War.
Viewed as a step by step process, assessment in the last twenty years involves:
1
EDUC43430: Self Directed Study Konstantinos Nassos
Types of assessment
The purpose of the initial assessment is to carry out a successful placement of the
student according to prior achievements and individual traits. The placement might
refer to a suitable mentor or an appropriate instructional strategy. Usually,
educational institutions evaluate college readiness and assign students to classes
via placement testing.
As a result, the student will be informed on their skill and the teacher on the specific
aspects requiring further clarifications. As the character of this assessment is
informative no grades are given.
2
EDUC43430: Self Directed Study Konstantinos Nassos
Advanced Placement Exams. (n.d.). In Wikipedia. Retrieved January 15, 2018, from
https://en.m.wikipedia.org/wiki/Advanced_Placement_exams
3
EDUC43430: Self Directed Study Konstantinos Nassos
end of the unit or the course. At that time the teacher will have the full picture of
student’s performance and will design the future learning strategy.
Comparison Basis
Test results can be compared against established criteria or individual’s and other
students’ performance:
4
EDUC43430: Self Directed Study Konstantinos Nassos
Assessment can be either formal or informal. The sources of Formal assessment are
mostly written papers such as documents, tests or quiz. The results of the formal
assessment are delivered in numerical scores heavily impacting the final grade.
Informal assessment is not affecting the final grade. The informal assessment is
applied in a casual way. A wide range of means may be employed: assessments of
5
EDUC43430: Self Directed Study Konstantinos Nassos
Internal and external assessment are characterized by the sources of the test and
the feedback/markings. When the teachers of the school carry out the assessment
and the marking the assessment is internal. When the tests are set by a body
outside the school and the feedback is delivered by another independent authority
then the assessment is external. The Australian external assessment NAPLAN,
places little importance in the feedback given to parents. They only know the
percentile of each type of test the students fall. Schools do not have access to the
questions hence the tests are criticized for merely giving indications and not detailed
performance.
Standards of quality
Reliability
6
EDUC43430: Self Directed Study Konstantinos Nassos
1. Stability: Test performance on a test is on the same scale in more than one
occasions.
2. Form equivalence: Similar content tests result in similar performances.
3. Internal consistency: An assessment is internally reliable when students give
similar answers when asked the same questions (Yu, Chong Ho 2005).
Validity
(Sundström, A. 2008)
Within the educational context, teachers often realize that the tests in effect
sometimes fail to assess the syllabus that were supposed to. The validity of these
tests is questioned.
7
EDUC43430: Self Directed Study Konstantinos Nassos
According to (Baird & Northfield, 1992, p. 21), student-grading and peer and
self-assessment generally refer to “specific judgments of ratings made by pupils
about their achievement, often in relation to teacher-designed categories”
Logistical Teachers do not have to spend their time marking papers when this can
be done by a whole classroom (Boud, 1989). Grading in this way is not only faster
(McLeod, 2001) but also more accurate as it can start right after the test or the
examination. The assessed students may receive more detailed feedback than that
of the teachers as the students who correct papers have the leisure of time which the
teachers in most of the cases cannot afford
(Weaver & Cotrell, 1986)
Metacognitive The concept of metacognition has been defined a few decades ago
as the awareness of one’s own knowledge. It also includes control and manipulation
of the cognitive processes of the individual. The experience of student grading their
own works is invaluable and goes beyond content knowledge
(Meichenbaum, 1985), (Brown, 1987)
8
EDUC43430: Self Directed Study Konstantinos Nassos
The literature
9
EDUC43430: Self Directed Study Konstantinos Nassos
those of their teachers’. Otherwise, the teachers will again have to invest time and
effort in feedback and new grading. (Sadler and Good, 2006)
This alignment is easy to achieve when questions are in the multiple choice format. It
is more taunting though when students need to address open-ended questions.
(Bloom & Krathwohl, 1956).
One of the reasons dictating that students must learn how to assess papers is the
subjective nature of open-ended questions (Boud, 1989; Neukom, 2000).
The acquisition of the subjective judgement has been mandatory and sought out
long time ago. The enigmatic skill of “guild knowledge” could be reached through
long lasting apprenticeship. The recipients of this knowledge were responsible for
transferring it to the new generations. (Sadler, 1989)
A good way for a teacher to specify their marking scheme would be through a rubric.
The rubric or a simple criteria sheet would provide instructions on how to assign
different grades for different levels of achievement.
(Baird & Northfield, 1992; Boud, 1989; Weaver & Cotrell, 1986).
Naturally, defining the thin lines that separate the various grades and especially
determine success from failure lies in the domain of the able teacher (Sadler, 1989)
Of the various forms of the grading process in effect this discussion will be focused
on self and peer grading. Peer grading may involve marking papers by students of
the same or another class where anonymity is observed. In some cases participants
receive extra points when their assessment approach the assessment of the
teachers.
10
EDUC43430: Self Directed Study Konstantinos Nassos
of topics such as humor, grammar and logic students “jumped” to the 62nd percentile
in their self-assessment. The above overestimations occurred in four different
studies. The authors attributed the perceptions of the students to their inability to
differentiate between accurate and wrong. This incapacity is the characteristic of
deficits in metacognitive skills. Interestingly, improvement of metacognitive skills
allows students to identify the boundaries of their abilities
(Kruger and Dunning,1999)
Further literature informs on weak correlation .21 between self-views and teachers’
markings (Hansford and Hattie 1982)
Similarly, college students’ grades are related to these of their students at a .39.
(Falchikov and Boud,1989)
Research paper (Hanrahan & Isaacs, 2001) – referred from now on as (HI, 2001) -
Assessing Self- and Peer-assessment: the students’ views
The participants
The intervention reported in the paper involved a large (N= 233) group of a third year
tertiary school. Students participating in the scheme had no training on self and peer
assessment.
11
EDUC43430: Self Directed Study Konstantinos Nassos
Bachelor of Science, 53
The researcher states that degrees at this university are modular. Hence it is quite
possible that two students meet specifically for a particular subject without sharing
any other subject. In other words there is no common background between peers.
Some of the students may be experts in psychology and some others may
participate in such a project for the first time.
The Subject
A psychology subject assessed in four parts. The students prepare a research essay
of about 1,500 words corresponding to one-fourth of their final semester grade.
Students are given specific dates for submitting self and peer-assessments.
Students' own assignments will be credited on condition of submitting honest, self
and peer-assessments. Additionally, a bonus of 1% would be credited to students
answering questions about their experience of self and peer-assessment.
The results
12
EDUC43430: Self Directed Study Konstantinos Nassos
We will attempt to assess these results by examining the process that produced
them and by looking at other studies on self and peer assessment:
The participants
The participants in this study were teachers and students in the Barbados islands.
Teachers were chosen to represent the full spectrum of achievements of ten high
schools: Starting at the top and finishing at the bottom level measured in national
examinations. Barbados together with 15 more anglophone areas take part in the
examinations of the Caribbean Examinations Council. These examinations are
mandatory for every high school student. The selection of the teachers was based
on the following list of background knowledge and skills:
Professional qualifications, work experience, communication skills, willingness to
remain at the job for the entire academic year, willingness to participate in the
exercise, enthusiasm, innovativeness, availability and the ability to get along well
with students. (McDonald & Boud, 2003)
Teachers’ training was based on two principals. The first was to get familiar with the
practices of the assessment. The second was dedicated to teaching the skill of
transferring this knowledge to other teachers of the same school. With this process
the training started from teachers of the school and ended to the specific teachers of
participant students. The final training of the students included a combination of
self-assessment training with existing curriculum content.
(McDonald & Boud, 2003)
13
EDUC43430: Self Directed Study Konstantinos Nassos
Students participating in the study were also carefully selected. Two groups of
students were formed: The experimental group with 256 participants and the control
group with 259 students. The selection of the students was carried out with criteria
equally detailed as those of the teachers. Students represented top, middle and
bottom level of achievements in the ten chosen high schools. To ensure
transparency in the selection method, the researchers verified students results with
the Caribbean Examinations Council for the years 1998, 1999 and 2000.
The teachers from each of the 10 high schools selected two form 5 classes
(corresponding to grade 11 in US and year 10 in Australia) according to the following
criteria:
A. Every class with maximum possible class size, about 35 students
B. Within each class, every teacher teaches the same subject at the same time
The same conclusions might not be so easily drawn from a sample of participants
coming from different pathways and curricula, such as the students of (HI, 2001)
The Subject
14
EDUC43430: Self Directed Study Konstantinos Nassos
The Results
15
EDUC43430: Self Directed Study Konstantinos Nassos
He also verified that the Office for Standards in Education auditing approved this
practice after enquiring about the criteria in effect. The benefits of self and
peer-assessment are endorsed by (Ofsted, 2014)
16
EDUC43430: Self Directed Study Konstantinos Nassos
At the beginning we discussed about “the heart of teaching and learning” which is no
other than the Assessment. (Black, P. J., 1993) Assessment provides measurable
outcomes for the learner, the opportunities to achieve these outcomes and the
mechanism to continuously inform learning. (Suskie, Linda (2004)
We then moved our focus to self and peer-assessment and the advantages over
teaching grading: Logistical, pedagogical, metacognitive and affective
(Bloom & Krathwohl,1956; Boud, 1989)
Issues of legality pertaining to peer assessment have been raised since the 2001
U.S. Supreme Court Case of Falvo v. Owasso School System and might still show
up with the latest GDPR. (HI, 2001) successfully tended these issues by establishing
mechanisms of anonymity in the methodology of their intervention.
We agree with (HI, 2001) that a sample of 233 students participating in the
intervention is enough to yield reliable results. Especially since these numbers are
corroborated with previous studies from the past: Falchikov (1986) using a group of
48 participants and Stefani (1992, 1994) between 54 and 67 respondents. However,
17
EDUC43430: Self Directed Study Konstantinos Nassos
(McDonald & Boud, 2003) just two years later carry out their intervention with 256
students receiving formal training and another 259 students participating as the
control group.
At the heart of any self or peer-assessment lies a set of skills called metacognitive.
(Meichenbaum, 1985), (Brown, 1987)
Furthermore, teaching metacognitive skills is directly related to the improvement of
students’ learning and these skills can be taught
(Nietfeld & Shraw, 2002; Thiede, Anderson, & Therriault, 2003)
The students are then prone to miscalibrations such as overestimations and other
flawed assessments. Specifically the average correlation between self or
peer-assessment and performance measured by teachers was as weak as .21
(Hansford, Hattie ,1982)
Further cases of overestimation, show that students assign higher marks in 68% of
the time (Falchikov, Boud 1989)
Looking at the sample of (HI, 2001) from a qualitative point of view, we find that all
participants came from the third year of the same tertiary school. Their backgrounds
and prior knowledge were mixed. The only thing the students had in common was
taking the psychology essay of the 1,500 words. The heterogeneity of the sample
reflects on the reliability of the study. If we cannot have similar cohorts of students
18
EDUC43430: Self Directed Study Konstantinos Nassos
then we cannot expect consistency among the results. (Mehrens, W. A., & Lehmann,
I. J. 1987)
On the other hand, the respondents of (McDonald & Boud, 2003) were chosen from
ten high schools with achievements from top, middle and bottom levels. They were
attending mainstream classes since first year of high school and were trained by
their teachers. We believe that such sample is in a position to offer targeted and
coherent views on the experience of self and peer-assessment. Consistency can be
measured between similar groups and the reliability can be evaluated.
The topic of (HI, 2001) is an open ended subject where students offer unprompted
views with no criteria set. The discussion of (HI, 2001) clarifies that the study is not
interested in identifying similar comments between themes and dimensions. As we
are deprived of possible criteria for validity, we are unsure as to whether the results
of this study can be verified.
(McDonald & Boud, 2003) are working with external examinations receiving specific
grades on a number of subjects. Except for the group receiving the training there is
another group, the control group. Consequently, there is a two fold validity of
students’ markings. Between the trained and the control group of the students and
between any of the students’ group and the teachers. As the trained group
outperformed the control group in all curriculum areas, we feel that the study
measured what it was intended to measure. Hence validity is achieved.
The findings from the Comprehensive School of Ashington, see appendix, report a
clear case of improvement in year 11 students’ performance when training in self
and peer-assessment is delivered. Mr B, teacher of Mathematics, started delivered
training at some classes of year 9. By the time the students reached year 11, Mr B
had ample proof that students of the trained classes performed better than the
19
EDUC43430: Self Directed Study Konstantinos Nassos
students of the other classes. The above findings align with literature such as
(Sadler, 1989) and their concept of “guild knowledge”.
Informal discussions with Greek year 12 Maths students, preparing for University
entrance examinations, revealed an interesting view on the effects of self and
peer-assessments. The students who received training at the beginning they did not
like the idea and they felt like trespassing their peers’ privacy. In the process they
came to identify and appreciate the feeling of collaboration. They clarified that the
real focus is the entrance to a University and peers and teachers are on their side
supporting their efforts.
Interestingly, when there was no training prior to the assessment, the students
expressed feelings of disbelief towards this new practice of their teachers. There
were roughly two dimensions in their views: They either felt that this was just a new
experiment from the Ministry of Education or their teachers were not conscientious
enough to train them. They unanimously stated that they assigned high marks to all
their assessments and hoped teachers and administrators received the message.
Looking at the above, we believe that self and peer assessment yield the
advantages we mentioned when students’ assessment resemble these of their
teachers. (Sadler and Good, 2006) On the other hand, we are aware that consistent
grades amongst different raters does not necessarily imply perfect fairness
(Marculides and Simkin, 1991)
20
EDUC43430: Self Directed Study Konstantinos Nassos
We believe that the contemporary “guild” (Sadler 1989), lies in the area of
Metacognitive skills (Meichenbaum, 1985), (Sadler 1998), (Boud 2000)
Indeed, these skills are rightly deemed to be the requirement for lifelong learning
(Nietfeld & Shraw, 2002; Thiede, Anderson, & Therriault, 2003)
21
EDUC43430: Self Directed Study Konstantinos Nassos
References
Alexander, P. A., Schallert, D. I., & Hare, V. C. (1991). Coming to terms. How
researchers in learning and literacy talk about knowledge. Review of Educational
Research, 61, 315–343.
Baird, J. R., & Northfield, J. R. (1992). Learning from the PEEL experience.
Melbourne, Australia: Monash University
Black, Paul, & William, Dylan (October 1998). "Inside the Black Box: Raising
Standards Through Classroom Assessment."Phi Beta Kappan. Available at
PDKintl.org. Retrieved January 28, 2009
Black, P., & Harrison, C. (2001). Self- and peer-assessment and taking
responsibility, the science student’s role in formative assessment. School Science
Review, 83, 43–48.
Black, P., & Atkin, J. M. (1996). Changing the subject. Innovations in science, math,
and technology education. London: Routledge.
22
EDUC43430: Self Directed Study Konstantinos Nassos
Cohen Louis, Manion Lawrence and Morrison Keith, 2004 Published on the
companion web resource for A Guide to Teaching Practice (RoutledgeFalmer).
Dunning, D., Heath, C., Suls, J.M (2004). Flawed Self-Assessment Implications for
Health, Education, and the Workplace, Psychological Science in the Public Interest,
Supplement, December 2004, Vol.5(3), pp.69-106
23
EDUC43430: Self Directed Study Konstantinos Nassos
ERIC ED GOV
http://eric.ed.gov/?id=ED041829
Falchikov N., Magin, D. (1997). Detecting Gender Bias in Peer Marking of Students’
Group Process Work, Assessment & Evaluation in Higher Education, 01 December
1997, Vol.22(4), p.385-396
24
EDUC43430: Self Directed Study Konstantinos Nassos
Kruger, J., & Dunning, D. (1999). Unskilled and unaware of it: How difficulties in
recognizing one's own incompetence lead to inflated self-assessments. Journal of
Personality and Social Psychology, 77(6), 1121-1134
25
EDUC43430: Self Directed Study Konstantinos Nassos
Kuh, G.D.; Jankowski, N.; Ikenberry, S.O. (2014). Knowing What Students Know and
Can Do: The Current State of Learning Outcomes Assessment in U.S. Colleges and
Universities (PDF). Urbana: University of Illinois and Indiana University, National
Institute for Learning Outcomes Assessment.
Louis Cohen, Lawrence Manion and Keith Morrison, 2004 Published on the
companion web resource for A Guide to Teaching Practice (RoutledgeFalmer).
Marcoulides, G. A., & Simkin, M. G. (1991). Evaluating student papers: the case for
peer review, Journal of Education for Business, 67, November/December, 80-83.
Mctighe, Jay; O'Connor, Ken (November 2005). "Seven practices for effective
learning". Educational Leadership. 63 (3): 10–17. Retrieved 3 March 2017
Mehrens, W. A., & Lehmann, I. J. (1987). Using standardized tests in education (4th
ed.). New York, NY, US: Longman/Addison Wesley Longman.
Meichenbaum, D., Burland, S., Gruson, L., & Cameron, R. (1985). Metacognitive
assessment. The growth of reflection in children, 3-30.
26
EDUC43430: Self Directed Study Konstantinos Nassos
Moskal, Barbara M., & Leydens, Jon A (2000). "Scoring Rubric Development:
Validity and Reliability." Practical Assessment, Research & Evaluation, 7(10).
Retrieved January 30, 2009
(Ncetm.org.uk, 2018)
Nelson, Robert; Dawson, Phillip (2014). "A contribution to the history of assessment:
how a conversation simulator redeems Socratic method". Assessment & Evaluation
in Higher Education. 39 (2): 195–204.
Newstead, S., & Dennis, I. (1990). Blind marking and sex bias in student
assessment. Assessment and Evaluation in Higher Education, 15, 132-139.
Newstead, S., & Dennis, I. (1994). Examiners examined: the reliability of exam
marking in psychology. The Psychologist, 7(5), 216-219.
27
EDUC43430: Self Directed Study Konstantinos Nassos
Nietfeld, J. L., & Shraw, G. (2002). The effect of knowledge and strategy explanation
on monitoring accuracy. Journal of Educational Research, 95(2), 131–142
Ofsted, (2014). Teaching, learning and assessment in further education and skills –
what works and why
www.ofsted.gov.uk/resources/140138
Pfeifer, J. K. (1981). The effects of peer evaluation and personality on writing anxiety
and writing performance in college freshmen. Unpublished master’s thesis, Texas
Tech University, Lubbock, TX.
Sadler, Philip M., and Eddie Good "The Impact of Self- and Peer-Grading on Student
Learning." Educational Assessment 11.1 (2006): 1–31
28
EDUC43430: Self Directed Study Konstantinos Nassos
University of Illinois
https://education.illinois.edu/circe/Robert_Stake.html
29
EDUC43430: Self Directed Study Konstantinos Nassos
Wu, Margaret (2015). "What National Testing Data Can Tell Us". In Lingard, Bob;
Thompson, Greg; Sellar, Sam. National Testing in Schools: An Australian
Assessment. Routledge. pp. 19–23, 27
Zoller, U., Tsaparlis, G., Fastow, M., & Lubezky, A. (1997). Student self-assessment
of higher-order cognitive skills in college science teaching. Journal of College
Science Teaching, 27, 99–101
30