Beruflich Dokumente
Kultur Dokumente
Page 1
Iames I. Brosamer
Department of English University of Alaska Anchorage Written assignments in large classes can be made practical by the use of holistic scoring. Data are Presented to show the reliability, validity, and efficiency of adapting holistic scoring to written work in an introductory psychology course in which the grading was
done entirely by a teaching assistant (TA). Separate criteria were
used to score the content and writing skill of paragraph/length essaytest answers and shortpaper assignments. judgments of writing skill made by Psychology TAs using holistic criteria were reliable and compared favorably with writingskill judgments made by experienced English composition instructors We con~ clude that holistic scoring is a highquality, l0w~cost approach to scoring written work in large classes.
Many college faculty agree that writing assignments are valuable additions to college classes. For Boyer (1987), the
ability to communicate well in writing and speaking is one of the hallmarks of an educated person. He listed the develop' ment of writing skills among the major objectives of a college education. The growth and influence of the writing~across'
the~curriculum movement is well attested (Griffin, 1982; Maimon, 1982; McLeod, 1988). Its relevance to psychology courses was explored in a special issue of this journal (Nodine, 1990). In addition to developing writing skills, there are other reasons to include written work in courses. Writing is a special mode of learning (Emig, 1977); it can improve critical thinking (Fulwiler, 1982) by encouraging writers to scrutinize the language expressing their thoughts. Written examinations may produce better mastery of course material than do multiplechoice tests (Frederiksen, 1984). The cost of written work is that it must be read and graded. The prospect of the grading task, coupled with the
Holistic grading may offer a way to address issues of work load and quality in evaluating written work. In holistic scor~
1985). Each category is described by three or four sentences that identify its salient characteristics. This article describes our adaptation of holistic scoring to
written work in a medium'sized introductory psychology
class in which all grading was done by teaching assistants (TAs). This situation required an efficient system that could easily be taught to new TAs. But a useful scoring system must also provide students with reliable and valid judgments of
their written work. lntergrader reliability was a concern in
In the following sections, we describe a holistic scoring system developed for this course and present data on its
efficiency, reliability, and validity. The grading system was
created for a General Psychology course taught in a lecture format to about 100 students. During the semester, a typical student wrote 4,000 words on essay exams and paper assignments. The primary grading was done by a TA and super~ vised by the course instructor.
The Scoring Approach The holistic scoring methods developed for the course
evolved from examples presented by White (1985) but differ from his in order to meet the pedagogical objectives of Gen eral Psychology. The course instructor intended the writing component of the course to foster content mastery and to support the development of students writing skills, whereas White was concerned only with the assessment of writing skill. New holistic scoring categories were developed to al~ low each writing sample to be given two scores, one for content and another for quality of expression. This distinc tion between content and writing skill tells the students that the ability to write well is important, and it provides them with direct feedback about the quality of their written work. The relative magnitude of the content and writing~skill
91
Page 2
scores was determined by the instructor's sense of their rela' tive pedagogical importance for the course. Content was weighted 80%, and writing skill was weighted 20%.
preferences. The TA assigned a score from the range to indicate the quality of the students work within the category.
The course generated two different types of written work: (a) responses to short~answer essay'test questions and (b) short papers. Scoring criteria were developed for each.
Weekly tests consisted of eight multiple'choice and two essay questions. Ten essay questions were distributed in ad vance on a weekly study guide; two of tne questions were selected for the weekly test. Students also received handouts describing the grading system and suggesting ways of doing well on the tests. Students were encouraged to prepare for
20 The answer addresses the question completely. It is concise and may contain original observations or examples. There is no irrelevant material.
Writing-Skill Scoreb Criteria 0 The answer is clearly inappropriate to the question. i The answer is so lacking in coherence and unity that it is very difcult to follow, or the answer is so brief that it says almost nothing.
2 The answer does not develop in a manner appropriate to the question; in addition, the material is not presented in a unified, coherent paragraph. 3 Either the answer does not develop in a manner appropriate to the question or the answer is a poorly constructed paragraph. 4 The answer is a unified, coherent paragraph that addresses the question appropriately. lt adequately de velops and expresses the writers thoughts.
5 The answer is a well developed, unified, and coherent paragraph that skillfully addresses the question. The writing is more than adequate; it is good.
92
the content otthe answer and up to 5 points tor writing skill. The grader read the student's work and assigned points for content and writing skill according to the categories given in Table l.
Short-Paper Scoring
Students could also earn course points by writing optional homework assignments. Qnly the most commonly selected
044 The paper does not come to terms with the assign-
4549 The paper ignores several content areas or shows a misunderstanding of the principles involved.
5054 The paper ignores more than one content area of the assignment but shows the writers ability to appreci-
ate key ideas and principles. 5559 The paper slights or ignores one of the content areas of the assignment. The writer demonstrates a clear understanding of the assignment and the psychological principles involved.
60 The paper deals fully with all content areas specified
as the full credit paper, but it is organized into unified, coherent sections and is largely free from serious errors in mechanics, usage, and sentence structure.
15 The paper shows careful organization and orderly
thinking. The transitions between content areas are smooth; the paper may organize the content areas
creatively to improve the readability of the paper;
and the author may make thoughtful comments. it is virtually free from errors in mechanics, usage, and
sentence structure. aContent requirements included an appropriate introduction, de-
Teaching of Psychology
Page 3
papers. Evidence supporting the efficacy of instructor comments to improve student writing comes from studies in which students revised earlier drafts based on instructor
feedback (Willingham, 1990; Ziv, 1984). This was not the case here. Furthermore, handwritten comments are exceed' ingly time'consuming. Students did receive a model answer
when their tests were handed back, and they were encour'
aged to discuss tests and papers with the TA.
Scoring Reliability Method Reliability was assessed by comparing the grades assigned by a firstayear graduate student and an undergraduate psy~
chology senior, inexperienced as a TA. The graduate stu~
dent had served as the TA for the course during two se~ mesters. The TA spent about 4 hr training the psychology
senior in the scoring methods. Two essayatest answers writ, ten in a previous class were available for 30 students. The
answers were typed on separate sheets of paper and scored by the two assistants using the criteria in Table 1.
To assess the reliability of Table 2's scoring criteria for
short papers, a set of 23 papers graded during the semester by the TA were also graded by the psychology senior. All papers were typed.
Results The reliabilities for content scores given to essay~test an~ swers were .91 and .95 for the two questions analyzed. The writing/skill scores for these questions had reliabilities of .61
and .75. The shortpaper scores yielded a content reliability of .68 and a writing'skill reliability of .71. Scoring Validity Method
The validity analysis focused on whether the writing'skill
scores given by the TAs were consistent with judgments of writing skill made by better trained evaluators of written work. Two experienced instructors of freshman composition read the same essay~test answers and short papers that had been previously scored by the TAs. The English instructors worked independently of each other and judged the essays separately from the papers. They were told to assign each writing sample to one of five categories based on its quality. Writing quality was defined as a subjective term that gen~ erally refers to a writers ability to express thoughts in a clear,
organized, and interesting manner. No other constraint
samples into five writingaquality levels, their interscorer re~ liability was .68 for the test essays and .65 for the short
papers. The average of the TAs writing'skill scores were correlated with the corresponding judgments of the English
instructors and yielded .73 for the essay answers and .48 for the short papers. Scoring Efficiency Method and Results
The third area of interest was the efficiency of the scoring
system. Time required to score each essay~test answer was
has been carefully refined and is administered by experi~ enced, trained judges.
The writingskill judgments of the essay~test answers made by psychology TAs had impressive validity. We con'
clude that the psychology TAs are able to use holistic scoring criteria to make writing~skill judgments about paragraph' length test answers that are comparable to the judgments of professional raters who are better trained and more
experienced. However, the agreement between the TAs and the reach ers of freshman composition was not as high for the short papers. We attribute this to the more complex nature of the short~paper assignments. The criteria used by the TAs to
nized, and interesting ways. They may have understood this to imply aspects of diction and style not captured by the
writingskill categories in Table 2. The use of somewhat different criteria to judge the short
the organization of the short papers. We do not know if the criteria could be successfully expanded to include other
important features of writing without compromising reliabil'
93
Page 4
ity. We suspect it would be difficult. Although the validity coefficient was not as high as we had hoped, judgments of the TAs were in substantial agreement with those of sophis ticated professionals in English. Students received reliable and useful information about at least one major dimension of their writing.
The data show that grading efficiency, the third major con cem of the project, is satisfactory. The total amount of time spent grading about 4,000 words written by a typical student in our introductory course is about 38 min (30 min to score 30
reliability, validity, and cost efficiency. It has functioned well for 3 years, serving almost 600 students under three
different TAs. Student feedback about the course has been solicited each semester, and not one complaint has been
reduced. Competent, committed TAs are essential, but they have not been difficult to find.
References
Boyer, E. L. (1987). College: The undergraduate experience in Amer ica. New York: Harper & Row. Emig, ]. (1977). Writing as a mode of learning. College Composition
and Communication, 28, 122128. Frederiksen, N. (1984). The real test bias. American Psychologist,
39, l93~ZOZ.
Fulwiler, T. (1982). Writing: An act ofcognition. In C. W. Griffin (Ed. ), Teaching writing in all disciplines (pp. 1526), San Francis; co: JosseyeBass. Griffin, C. W. (Ed). (1982). Teaching writing in all disciplines. San Francisco: losseyBass. Holland, M. K. (1985). Usingpsychology (3rd ed.). Boston: Little, Brown. Madigan, R., & Brosamer, J. (1990). Improving the writing skills of students in introductory psychology. Teaching of Psychology,
17, 2730.
Maimon, E. P. (1982). Writing across the curriculum: Past, present, and future. In C. W. Griffin (Ed), Teaching writing in all disciplines (pp. 67~74). San Francisco: lossey-Bass.
Francisco: lossey'Bass.
Willingham, D. B. (1990). Effective feedback on written assign,
chorage, AK 99508.
to test their hypotheses. This article describes the methods used to perform the psychic acts and outlines the structure of the hy~ pothesistesting activity. By allowing students to experience firsthand the importance of the rules of science, this instruc tional method may encourage greater scientific skepticism than Teaching of Psychology