Beruflich Dokumente
Kultur Dokumente
research-article2019
participation in “authentic practice” (Sawyer 2006, 5), itself a significant goal for
learning. Disciplinary reforms of the 1990s already recognized that learning with
understanding is principled, entails mental organizational structures, and enables
transfer across contexts. In recent decades, researchers in the learning sciences
have found that deep learning occurs when students “engage in activities that are
similar to the everyday activities of professionals who work in a discipline”
(Sawyer 2006, 4); and it follows necessarily that this type of learning is specific to
each discipline and community of expertise, rather than involving generic reason-
ing abilities (National Research Council 2012).
For example, the National Research Council (2007) Committee on Science
Learning, Kindergarten through Eighth Grade identified four interconnected
strands of learning needed to achieve scientific proficiency:
1. know, use, and interpret scientific explanations of the natural world (i.e.,
phenomena);
2. generate and evaluate scientific evidence and explanations;
3. understand the nature and development of scientific knowledge; and
4. participate productively in scientific practices and discourse (p. 334).
Traditional learning goals for K–8 science education account for only part of
strands 1 and 3; but even the idea of hypothesis testing in strand 3 now means
being able to conjecture and generate research questions, not just performing
cookbook versions of the “scientific method.”
At the turn of the twenty-first century, policy-makers, politicians, and business
leaders became keen on expanding the definition of learning goals, for different
reasons; but in general, they were concerned about international competitiveness
(as they had been in the 1990s) and the need for a workforce with technological
and analytical thinking skills (National Alliance of Business 2002). In 2009, when
the Common Core State Standards Initiative was launched by the National
Governors Association and the Council of Chief State School Officers (2010a,
2010b), “twenty-first-century skills” became the new buzz phrase (Mathews
2009). As disciplinary researchers became involved in the development of new
standards, a key feature in both English language arts and mathematics, as well
as the Next Generation Science Standards (National Research Council 2013),
was the integration of content strands with disciplinary practices. Also in 2009,
the National Research Council Committee on Defining Deeper Learning and
21st Century Skills was convened to examine research evidence for how such
skills are developed. The committee acknowledged that these types of goals for
learning are not new; skills and abilities such as critical thinking, reasoning and
argumentation, innovation, flexibility, initiative, self-reflection, collaboration, and
communication have always been valued in society, but what may be new is the
expectation that all students develop these abilities (National Research Council
2012). The committee identified three main categories of skills—cognitive,
intrapersonal, and interpersonal—that are inherently intertwined and cannot be
taken apart for separate didactic treatment. In this context, formative assessment
became increasingly valued as a tool to guide student progress.
Classroom Assessment to Support Teaching and Learning 187
strategies and making connections between new problems and work already
mastered.
The research literature on motivation to learn is vast and difficult to distill
because there are numerous theories that overlap with similar (but differently
named) constructs. For example, according to self-determination theory (Deci
and Ryan 1985, 2008) individuals have basic needs for competence, autonomy,
and what they call relatedness. Autonomous or intrinsic motivation involves voli-
tion and choice and leads to productive and successful outcomes. By contrast,
controlled motivation comes from pressure and sources of control outside the
self. These ideas are quite similar to research on goal orientation whereby stu-
dents are said to adopt either mastery or performance goals (Dweck 1986).
Students working toward mastery or task-involved goals are willing to invest
effort in becoming more competent, to tackle more difficult problems, and to
willingly seek new strategies. By contrast, students with performance or an ego-
involved orientation many engage in learning behaviors but mostly to get good
grades or to avoid being judged incompetent.
Closely related to students’ sense of autonomy versus control are beliefs about
the nature of intelligence. As outlined by Dweck (2002), children who believe
that intelligence can be developed are more willing to invest effort in learning,
are more self-confident, and exhibit “hardiness” in the face of setbacks. Currently
popular growth mindset (Dweck 2006) interventions are one example of short-
duration, psychological interventions shown to improve student achievement by
altering beliefs about intelligence, stereotype threat, or other self-conceptions
(Yeager and Walton 2011). These interventions are difficult to scale up, however,
because they can easily devolve to superficial imitations. A better approach is not
to conceive of separate motivational therapies but rather to develop a classroom
culture focused on learning (Shepard 2000), where trajectories for development
of academic, intrapersonal, and interpersonal capabilities are integrated and
mutually supportive (National Research Council 2012). This means, for example,
resisting grading practices and normative comparisons that by definition make
only some children winners.
Figure 1
A Progression-Based Model of a Classroom Learning Culture Connecting Formative
Assessment Practices with Ambitious Teaching
to facilitate learning, teachers must know their students well—not only their personali-
ties and preferences, but also their ideas about subjects and their ways of thinking about
them, including their intellectual habits, misconceptions, and interests. They must
understand the ways in which students’ personal and cultural backgrounds bear on their
work in school and be able to respond with appropriate instructional activities. This
means skillfully eliciting, probing, and analyzing students’ thinking through verbal
interactions and written work. (Ball and Forzani 2011, 20, emphasis added)
Although being able to lead a discussion may seem like a generic skill, for the
most part developing a repertoire of specific teaching strategies implies that most
high-leverage practices are subject-matter specific and, when possible, are grade-
level and community specific. Ball and Forzani (2011), for example, argue that
second grade teachers should be able to probe for understanding of complex
subtraction problems, and middle school English teachers should have strategies
not just for teaching academic English but for helping students to understand
“how and when to use academic English” (p. 21). An extensive team of research-
ers has documented the distinct reading, reasoning, and argumentation practices
that lead to discipline-specific learning goals in literature, science, and history
Classroom Assessment to Support Teaching and Learning 193
(Goldman et al. 2016). In science, Windschitl et al. (2012) created discourse-
based patterns of instruction that included (1) eliciting students’ ideas, which are
then analyzed and used to adapt instruction; (2) guiding student sense-making by
asking them to draw models, share interpretations with classmates, and make
revisions; and (3) pressing students for evidence-based explanations. These ideas
in the core practices literature, especially about eliciting and responding to stu-
dent thinking (McDonald, Kazemi, and Kavanagh 2013), are completely consist-
ent with the formative assessment and learning progression literature, especially
Alonzo’s (2017) ideas about informal learning progressions when formal progres-
sions have yet to be developed.
To reiterate, formative assessment is carried out during the instructional pro-
cess for the purpose of adapting instruction to improve learning (Penuel and
Shepard 2016, 788). It can be conceived of as a set of core practices that intersect
with those intended to support deep learning and participation in disciplinary
discourse practices. More importantly, however, like ambitious teaching, forma-
tive assessment implies an ethos or classroom culture focused on equitable and
collaborative learning. Explicitly focusing classroom activity on what is being
learned was one of the most significant contributions of the Assessment Reform
Group efforts in the United Kingdom in the 1990s. Formative assessment strate-
gies include explicit sharing of learning goals and criteria for judging quality work,
questioning and other classroom routines that make thinking visible, explicit feed-
back plus informal feedback through hearing other students’ ideas, and peer- and
self-assessment. These techniques are important for providing information and for
shifting the nature of classroom interactions, but they are insufficient by them-
selves if there is not a commensurate change in the social meaning of evaluation
(Shepard 2000). Deep learning can only be supported in a cultural context of trust
and respect where students are willing to reveal what they currently understand
with full confidence that talking about ideas will surely lead to new learning.
As elaborated elsewhere, a sociocultural approach to formative assessment
involves more attention being paid to codefining emergent learning goals with
students—goals that explicitly attend to identity and that support students in
navigating between “everyday and disciplinary forms of thinking, being, and
doing” (Penuel and Shepard 2016, 821). Curricular activities are designed to
build from students’ interests, experiences, and funds of knowledge and to focus
on endpoints that are meaningful to students. Assessment processes are embed-
ded in social practices so that they remain authentic to disciplinary expertise. For
example, Windschitl et al. (2012) build students’ content knowledge through
classroom routines whereby students explain their ideas to others, construct and
compare theories, and justify claims, all of which are occasions for peer and self
assessment. Ambitious teaching practices have been shown to be possible and,
when enacted, produce rewarding results; but such practices are neither ordinary
nor routine. I turn now to research evidence on teacher learning to identify the
supports needed to make these kinds of transformations possible.
194 THE ANNALS OF THE AMERICAN ACADEMY
importantly, they should not have to make sense of multiple reforms separately.
Research on formative assessment demonstrates its extraordinary potential to
improve student learning, but not if it is implemented in a way that is at cross-
purposes with the underlying theory of learning. Nor does it make sense to
implement formative assessment as its own intervention. Greater coherence and
greater effectiveness are much more likely if formative assessment professional
development is integrated with ambitious teaching practices and with curricular
reforms in literacy, mathematics, social studies, and so forth; and if professional
learning communities work out explicitly how they will protect the intentions of
formative assessment practices from grading requirements.
In our analysis of formative assessment interventions (Penuel and Shepard
2016), we outlined theories of action as well as theories of learning for each
formative assessment approach. Our theories of action specifically considered
what material resources and what professional opportunities would be needed to
support teachers in taking on new roles. Nearly all sociocognitive interventions
are embedded in curriculum or learning trajectories, which require extensive
development and field testing. These resources are not scripts and require fur-
ther adaptation in local contexts; but because teachers do not have to start from
scratch to devise activities, they can focus more intently on trying out new
instructional strategies, analyzing students’ thinking, and sharing “breakthroughs”
with colleagues, that is, the critical questions, modeling, or peer interactions that
helped students to get past familiar stumbling blocks. For sociocultural interven-
tions, material resources can include effective strategies for eliciting students’
interests, experiences, and funds of knowledge.
thinking and reasoning skills is needed for historical inquiry where students must
be able to contextualize and interpret primary documents and then reconcile
accounts from multiple historical documents (Reisman 2012).
The research-informed model for classroom assessment proposed in this arti-
cle shows how discipline-specific learning progressions or local curricula can be
used to organize both instructional activities and formative assessment processes
so that they build toward intended learning goals represented in summative
assessments. Contemporary research on learning also requires that attention be
paid to the motivational consequences of student evaluation. This means that
educators and policy-makers must be better informed about the kinds of assess-
ment practices that attend to students’ assets from home and community and to
ways of enabling self-regulation versus practices that rely on extrinsic motivation.
It makes little sense for school districts to invest in popular social psychology
interventions, such as growth mindset (Dweck 2002), without recognizing that
district testing, grading, and score-posting requirements have far more pervasive
and long-term effects on students if they are made to see themselves as less than
capable learners.
A central argument made in this article and in prior work (Penuel and Shepard
2016) is that, to be effective, assessment interventions must be designed accord-
ing to a research-based theory of learning. For classroom purposes, both “big”
and “little” theories are required (Shepard, Penuel, and Pellegrino 2018).
Teachers need big-picture understandings of how cognitive competencies such as
critical thinking, reasoning, and problem solving are jointly developed along with
personal competencies such as self-regulation, collaboration, and communication
(National Research Council 2012). “Little” theories refer to fine-grained
models—on the scale of classroom instructional units and lessons—that describe
how knowledge and skills are developed in a particular domain. Enactment of
these theories requires the support of empirically tested learning progressions or
specific curricula designed to reach the ambitious learning goals set by Common
Core State Standards (National Governors Association Center for Best Practices
2010a, 2010b) and Next Generation Science Standards (National Research
Council 2013). Research on teacher learning is clear: the kinds of changes in
teaching practice being called for by new standards are enormous, not just tiny
adjustments. Teachers need material resources and the support of professional
communities to make these changes.
An important point for policy-makers to understand, then, is that local control
of curriculum in the United States makes it unlikely that most states can design
the kinds of coherent, curricular activity systems (Roschelle, Knudsen, and
Hegedus 2010) needed to integrate curriculum, instruction, assessment, and
teacher learning at the classroom level. That’s why districts (or consortia of
smaller districts) would be the appropriate level of authority for the development
and implementation of such systems (Shepard, Penuel, and Pellegrino 2018).
Local control of curriculum also creates a particular problem for conceptualizing
state assessments because accountability tests at the state level must necessarily
be curriculum-neutral; that is, they cannot favor one district’s curriculum over
another. Vertical coherence between classroom and state-level assessments is an
Classroom Assessment to Support Teaching and Learning 197
References
Alonzo, Alicia C. 2017. Models of student cognition as guides for formative assessment, Michigan
Formative Assessment Academy Webinars, Michigan Mathematics and Science Centers Network,
October 16, Ann Arbor, MI.
Ball, Deborah L., and Francesca M. Forzani. 2009. The work of teaching and the challenge of teacher
education. Journal of Teacher Education 60 (5): 497–511.
Ball, Deborah L., and Francesca M. Forzani. 2011. Building a common core for learning to teach and
connecting professional learning to practice. American Educator 35 (2): 17–21, 38–39.
Bertrand, Melanie, and Julie A. Marsh. 2015. Teachers’ sensemaking of data and implications for equity.
American Educational Research Journal 52 (5): 861–93.
Black, Paul, Christine Harrison, Clare Lee, Bethan Marshall, and Dylan Wiliam. 2003. Assessment for
learning: Putting it into practice. Maidenhead, UK: Open University Press.
Black, Paul, and Dylan Wiliam. 1998a. Assessment and classroom learning. Assessment in Education:
Principles, Policy, and Practice 5 (1): 7–74.
Black, Paul, and Dylan Wiliam. 1998b. Inside the black box: Raising standards through classroom assess-
ment. London: Department of Education & Professional Studies, King’s College London.
Braaten, Melissa, Chris Bradford, Kathryn Kirchgasler, and Sadie Fox Barocas. 2017. How data use for
accountability undermines equitable science education. Journal of Educational Administration 55 (4):
427–46.
Brookhart, Susan M. 2018. Learning is the primary source of coherence in assessment. Educational
Measurement: Issues and Practice 37 (1): 35–38.
Butler, Ruth. 1988. Enhancing and undermining intrinsic motivation: The effects of task-involving and
ego-involving evaluation on interest and performance. British Journal of Educational Psychology 58
(1): 1–14.
Calabrese Barton, Angela, Edna Tan, and Ann Rivet. 2008. Creating hybrid spaces for engaging school
science among urban middle school girls. American Educational Research Journal 45 (1): 68–103.
Darling-Hammond, Linda, and Nikole Richardson. 2009. Teacher learning: What matters? Educational
Leadership 66 (5): 46–53.
Deci, Edward L., and Richard M. Ryan. 1985. Intrinsic motivation and self-determination in human
behavior. New York, NY: Plenum Press.
Deci, Edward L., and Richard M. Ryan. 2008. Facilitating optimal motivation and psychological well-
being across life’s domains. Canadian Psychology 49 (1): 14–23.
Dweck, Carol S. 1986. Motivational processes affecting learning. American Psychologist 41 (10): 1040–48.
Dweck, Carol S. 2002. The development of ability concepts. In Development of achievement motivation,
eds. Allan Wigfield and Jacquelynne Eccles, 57–88. San Diego, CA: Academic Press.
Dweck, Carol S. 2006. Mindset: The new psychology of success. New York, NY: Random House.
Goldman, Susan R., M. Anne Britt, Willard Brown, Gayle Cribb, MariAnne George, Cynthia Greenleaf,
Carol D. Lee, Cynthia Shanahan, and Project READI. 2016. Disciplinary literacies and learning to
Classroom Assessment to Support Teaching and Learning 199
read for understanding: A conceptual framework for disciplinary literacy. Educational Psychologist 51
(2): 219–46.
Graesser, Arthur C., Danielle S. McNamara, and Kurt Van Lehn. 2005. Scaffolding deep comprehension
strategies through Point & Query, AutoTutor, and iSTART. Educational Psychologist 40 (4): 225–34.
Guskey, Thomas R. 2009. Practical solutions for serious problems in standards-based grading. Thousand
Oaks, CA: Corwin Press.
Hattie, John, and Helen Timperley. 2007. The power of feedback. Review of Educational Research 77 (1):
81–112.
Herman, Joan L. 2010. Coherence: Key to next generation assessment success. Los Angeles, CA: CRESST.
Herman, Joan. L. 2013. Formative assessment for next generation science standards: A proposed model.
Princeton, NJ: Educational Testing Service.
Holland, Dorothy, and Jean Lave. 2009. Social practice theory and the historical production of persons.
Action: An International Journal of Human Activity Theory 2:1–15.
Kluger, Avraham N., and Angelo DeNisi. 1996. The effects of feedback interventions on performance: A
historical review, a meta-analysis, and a preliminary feedback intervention theory. Psychological
Bulletin 119 (2): 254–84.
Lampert, Magdalene, and Filippo Graziani. 2009. Instructional activities as a tool for teachers’ and teacher
educators’ learning in and for practice. Elementary School Journal 109 (5): 491–509.
Lave, Jean, and Etienne Wenger. 1991. Situated learning: Legitimate peripheral participation. Cambridge:
Cambridge University Press.
Lepper, Mark R., and Maria Woolverton. 2002. The wisdom of practice: Lessons learned from the study
of highly effective tutors. In Improving academic achievement: Impact of psychological factors on
education, edited by Josua Aronson, 135–58. San Diego, CA: Academic.
Mathews, Jay. 5 January 2009. The rush for “21st-century skills.” The Washington Post.
McDonald, Morva, Elham Kazemi, and Sarah S. Kavanagh. 2013. Core practices and pedagogies of
teacher education: A call for a common language and collective activity. Journal of Teacher Education
64 (5): 378–86.
McMillan, James H. 2001. Secondary teachers’ classroom assessment and grading practices. Educational
Measurement: Issues and Practice 20 (1): 20–32.
Mislevy, Robert J. 2019. Advances in measurement and cognition. The ANNALS of the American Academy
of Political and Social Science (this volume).
Moll, Luis C., Cathy Amanti, Deborah Neff, and Norma Gonzalez. 1992. Funds of knowledge for teach-
ing: Using a qualitative approach to connect homes and classrooms. Theory into Practice 31 (2): 132–
41.
National Alliance of Business. 2002. A nation of opportunity: Building America’s 21st century workforce.
Washington, DC: Department of Labor.
National Governors Association Center for Best Practices, and Council of Chief State School Officers.
2010a. Common Core State Standards for English Language Arts. Washington, DC: National
Governors Association Center for Best Practices, and Council of Chief State School Officers.
National Governors Association Center for Best Practices, and Council of Chief State School Officers.
2010b. Common Core State Standards for Mathematics. Washington, DC: National Governors
Association Center for Best Practices, and Council of Chief State School Officers.
National Research Council. 2000. How people learn: Brain, mind, experience, and school, eds. John D.
Bransford, Ann L. Brown, and Rodney R. Cocking. Committee on Developments in the Science of
Learning, Commission on Behavioral and Social Sciences and Education, National Research Council.
Washington, DC: National Academies Press.
National Research Council. 2001. Knowing what students know: The science and design of educational
assessment, eds. James W. Pellegrino, Naomi Chudowsky, and Robert Glaser. Committee on the
Foundations of Assessment. Board on Testing and Assessment, Center for Education. Division of
Behavioral and Social Sciences and Education. Washington, DC: National Academies Press.
National Research Council. 2007. Taking science to school: Learning and teaching science in grades K-8,
eds. Richard A. Duschl, Heidi A. Schweingruber, and Andrew W. Shouse. Committee on Science
Learning, Kindergarten through Eighth Grade. Board on Science Education, Center for Education.
Division of Behavioral and Social Sciences and Education. Washington, DC: National Academies
Press.
200 THE ANNALS OF THE AMERICAN ACADEMY
National Research Council. 2012. Education for life and work: Developing transferable knowledge and
skills in the 21st century, eds. James W. Pellegrino and Maragaret L. Hilton. Board on Testing and
Assessment and Board on Science Education, Division of Behavioral and Social Sciences in Education,
edited by Committee on Defining Deeper Learning and 21st Cenury Skills. Washington, DC: National
Academies Press.
National Research Council. 2013. Next Generation Science Standards: For states, by states. Washington,
DC: National Academies Press.
National Research Council. 2014. Developing assessments for the Next Generation Science Standards.
Washington DC: National Academies Press.
Organisation for Economic Co-operation and Development (OECD). 2005. Formative assessment:
Improving learning in secondary classrooms. Paris: OECD.
Penuel, William R., and Lorrie A. Shepard. 2016. Assessment and teaching. In Handbook of research on
teaching, 5th ed., eds. Drew H. Gitomer and Courtney A. Bell, 787–850. Washington, DC: American
Educational Research Association.
Pintrich, Paul R. 2000. Multiple goals, multiple pathways: The role of goal orientation in learning and
achievement. Journal of Educational Psychology 92 (3): 544–55.
Putnam, Ralph T., and Hilda Borko. 2000. What do new views of knowledge and thinking have to say about
research on teacher learning? Educational Researcher 29 (1): 4–15.
Reese, William J. 1999. Testing wars in the public schools: A forgotten history. Cambridge, MA: Harvard
University Press.
Reisman, Avishag. 2012. Reading like a historian: A document-based history curriculum intervention in
urban high schools. Cognition and Instruction 30 (1): 86–112.
Rogoff, Barbara. 1997. Evaluating development in the process of participation: Theory, methods, and
practice building on each other. In Change and development: Issues of theory, application, and method,
eds. Erie Amsel, and Ann Renninger, 265–85. Hillsdale, NJ: Erlbaum.
Roschelle, Jeremy, Jennifer Knudsen, and Stephen Hegedus. 2010. From new technological infrastruc-
tures to curricular activity systems: Advanced designs for teaching and learning. In Designs for learning
environments of the future: International perspectives from the learning sciences, eds. Michael
Jacobson and Peter Reimann, 233–62. New York, NY: Springer.
Sawyer, R. Keith, ed. 2006. The Cambridge handbook of the learning sciences. New York, NY: Cambridge
University Press.
Shepard, Lorrie A. 2000. The role of assessment in a learning culture. Educational Researcher 29 (4): 4–14.
Shepard, Lorrie A. 2006. Classroom assessment. In Educational measurement, 4th ed., ed. Robert L.
Brennan, 623–46. Westport, CT: Praeger.
Shepard, Lorrie A. 2008. Formative assessment: Caveat emptor. In The future of assessment: Shaping
teaching and learning, ed. Carol Anne Dwyer, 279–303. New York, NY: Lawrence Earlbaum.
Shepard, Lorrie A. 2018. Learning progressions as tools for assessment and learning. Applied Measurement
in Education 31 (2): 165–74.
Shepard, Lorrie A., William R. Penuel, and James W. Pellegrino. 2018. Using learning and motivation
theories to coherently link formative assessment, grading practices, and large-scale assessment.
Educational Measurement: Issues and Practice 37 (1): 21–34.
Shute, Valerie J. 2008. Focus on formative feedback. Review of Educational Research 78 (1): 153–89.
U.S. Congress, Office of Technology Assessment. 1992. Testing in American schools: Asking the right ques-
tions. OTA-SET-519. Washington, DC: U.S. Government Printing Office.
Vinovskis, Maris A. 2019. History of testing in the United States: PK–12 education. The ANNALS of the
American Academy of Political and Social Science (this volume).
Windschitl, Mark, Jessica Thompson, Melissa Braaten, and David Stroupe. 2012. Proposing a core set of
instructional practices and tools for teachers of science. Science Education 96 (5): 878–903.
Yeager, David S., and Gregory M. Walton. 2011. Social-psychological interventions in education: They’re
not magic. Review of Educational Research 81 (2): 267–301.