Natriello 1987

This article was downloaded by: [Monash University Library]
On: 08 January 2015, At: 01:53

Publisher: Routledge
Informa Ltd Registered in England and Wales Registered Number: 1072954 Registered office: Mortimer House,
37-41 Mortimer Street, London W1T 3JH, UK
Educational Psychologist
Publication details, including instructions for authors and subscription information:
http://www.tandfonline.com/loi/hedp20
The Impact of Evaluation Processes on Students

Gary Natriello
Published online: 08 Jun 2010.
To cite this article: Gary Natriello (1987) The Impact of Evaluation Processes on Students, Educational Psychologist, 22:2,
155-175, DOI: 10.1207/s15326985ep2202_4
To link to this article: http://dx.doi.org/10.1207/s15326985ep2202_4
PLEASE SCROLL DOWN FOR ARTICLE
Taylor & Francis makes every effort to ensure the accuracy of all the information (the “Content”) contained
in the publications on our platform. However, Taylor & Francis, our agents, and our licensors make no
representations or warranties whatsoever as to the accuracy, completeness, or suitability for any purpose of the
Content. Any opinions and views expressed in this publication are the opinions and views of the authors, and
are not the views of or endorsed by Taylor & Francis. The accuracy of the Content should not be relied upon and
should be independently verified with primary sources of information. Taylor and Francis shall not be liable for
any losses, actions, claims, proceedings, demands, costs, expenses, damages, and other liabilities whatsoever
or howsoever caused arising directly or indirectly in connection with, in relation to or arising out of the use of
the Content.
This article may be used for research, teaching, and private study purposes. Any substantial or systematic
reproduction, redistribution, reselling, loan, sub-licensing, systematic supply, or distribution in any
form to anyone is expressly forbidden. Terms & Conditions of access and use can be found at http://
www.tandfonline.com/page/terms-and-conditions
EDUCATIONAL PSYCHOLOGIST, 22(2), 155-175
Copyright o 1987, Lawrence Erlbaum Associates, Inc.
The Impact of Evaluation

Processes on Students
Gary Natriello
Teachers College, Columbia University
This review provides a synthesis of recent research on the evaluation of students

in schools and classrooms. A conceptual framework is developed to highlight
key elements of the evaluation process. Within this framework the impact of al-
Downloaded by [Monash University Library] at 01:53 08 January 2015
terable features of evaluation processes is considered. These features include

evaluation purposes, task resolution, clarity of criteria, the demand level and
referents of standards, the frequency of sampling performance and outcomes,
the soundness of appraisals, and the differentiation and affective value of feed-
back. A range of effects on students and classrooms is examined. Studies of
evaluation processes are found to be limited by a lack of descriptive informa-
tion on actual evaluation practices in schools and classrooms, a concentration
on one or two aspects of a multifaceted evaluation process, and the failure to
consider the multiple purposes that evaluation systems must serve in schools
and classrooms. Recommendations are made for future studies to overcome
these limitations.
The evaluation of student performance is a central task of school administra-

tors and teachers. Indeed, evaluation activities permeate the educational
process. Although this is particularly apparent during times such as the pres-
ent when schools are under increased pressure for greater accountability and
improved performance, the pressure on and interest in evaluation processes
is nothing new. Throughout the history of American education, evaluation
of student performance has been an element of enduring concern to educa-
tors, to students, and to parents (Crooks, 1933). It is thus a fitting subject for
the continuing interest of social scientists and educators who have amassed a
considerable body of research and commentary related to the evaluation
process.
This article summarizes the research on evaluation processes in schools
Requests for reprints should be sent to Gary Natriello, Teachers College, Columbia Univer-
sity, Box 85, New York, NY 10027.
and classrooms by (a) briefly reviewing a conceptual framework for thinking
about the evaluation process; (b) examining research on the impact of fea-
tures of the evaluation process on students, with particular emphasis on alter-
able elements of the process; and (c) considering the limitations of previous
research and directions for future research.
A CONCEPTUAL FRAMEWORK FOR SCHOOL

AND CLASSROOM EVALUATION PROCESSES
Numerous frameworks for thinking about the evaluation of performers

have been proposed (Natriello, 1985). Figure 1 presents one model of the
evaluation process that is relatively simple yet sufficiently robust to permit
discussion of most of the major elements of evaluation processes for students
in schools and classrooms. Each of the stages of the model suggests features
of the evaluation process that must be attended to by evaluators and that may
have an impact on students. Brief consideration of each of these stages sug-
The Evaluation of Students for Evaluating Students
Providina Feedback Assimina Tasks

to Student Performers to Students
Dralslng
, ,
-
x
S t u d e n t Performance Student P e r f o m r i
n - e---- -ins Standards for

0 1 Student Performance Student P e r f o m n c c
(5) (4)
Figure 1 A model of evaluation processes in schools and classrooms.

IMPACT OF EVALUATION PROCESSES 157
gests some emerging issues concerning evaluation processes that confront

teachers, students, and researchers.
Establishing the Purposes for Evaluating Students
The first stage in the model is establishing purposes for evaluating students,
which suggests that there are multiple purposes for the evaluation of student
performance. Although there are a number of brief discussions of the pur-
poses of evaluation in many texts on measurement and evaluation (e.g.,
Ahmann & Glock, 1967; Lien, 1967; Remmers, Gage, & Rummel, 1960), the
purposes of evaluation receive scant attention in the literature.
Discussion of the purposes of evaluation of student performance suggest
that there are four generic functions that evaluation processes are thought to
serve: certification, selection, direction, and motivation. Certification refers
to the assurance that a student has attained a certain level of accomplishment
or mastery. Selection entails the identification of students or groups of stu-
dents to be recommended or permitted to enter or continue along certain edu-

cational and occupational paths. Direction refers to the use of evaluation
processes to communicate the specific desires of evaluators to those being
evaluated and to allow evaluators to engage in diagnosis and further plan-
ning. Motivation entails engaging those being evaluated in the tasks at hand.
Assigning Tasks to Students
The second stage of the evaluation process is assigning tasks to students. It

is only through the process of assignment that students are put on notice that
they are expected to perform a certain task. Tasks consist of stimulus materi-
als, instructions about operations, and instructions about goals (Hackman,
1969).
Certain characteristics of academic tasks are particularly likely to affect
the operation of the evaluation system in a classroom. Academic tasks that
are less predictable or that carry greater ambiguity or risk appear to place
greater demands on evaluation processes. Such tasks often lead to evaluation
processes that seem to be arbitrary, leaving students unable to relate their
performances to the evaluations they receive and teachers unable to ration-
ally justify the evaluation process (Dornbusch & Scott, 1975; Thompson,
1967). When the relationship between procedures or operations and results is
not straightforward, teachers are not justified in basing evaluations only on
the results of assignments; they must also consider procedures. However, as-
sessing student performance of procedures is often problematic because aca-
demic tasks generally involve mental processes that are not readily visible to
teachers in classrooms (Natriello & Dornbusch, 1984).
158 NATRIELLO
The strain placed on evaluation processes in schools and classrooms by less

predictable tasks leads to two tendencies. First, there is a tendency to
avoid ambiguous tasks both on the part of students (Davis & McKnight,
1976; Doyle, 1983; Wilson, 1976) and on the part of teachers (Holmes, 1978;
Natriello & Dornbusch, 1984). Second, there is a tendency to structure evalu-
ation activities as if the tasks being evaluated are predictable and unam-
biguous. For example, in a study of three reading curricula, Armbruster, Ste-
vens, and Rosenshine (1977) found that although the texts emphasized com-
prehension and interpretation skills, the tests solicited factual information
from students based on the ability to locate information in the text. In the
face of the considerable complexity of most school tasks (Doyle, 1983),
avoiding less predictable tasks and treating all tasks as if they are predictable
simplifies the evaluation process and makes life in classrooms more secure
both for students who must attain acceptable evaluations and for teachers
who must be able to justify their evaluation practices to students, parents,
and administrators. Of course, this diminishes the likelihood that teachers

and students will engage in creative or heuristic tasks.
Setting Criteria for Student Performance
The third stage moves beyond the general assignment of a task to provide in-
formation on the properties of the task that will be considered important in
the evaluation of performance. Although there is little discussion of task-
specific criteria for evaluation in the evaluation literature, attention has been
devoted to the types of criteria employed in the evaluation process. It is gen-
erally accepted that the achievement of students in a subject is the one crite-
rion common to all evaluation systems in schools and classrooms (Brown,
1971). There is little discussion as to the appropriateness of using achieve-
ment criteria, though in recent years there has been increased attention de-
voted to determining whether the evaluation process is actually linked to the
instructional process (Linn, 1983; Rudman et al., 1980) so that students are
not placed in a position where they are evaluated on things not covered in the
instructional program (Natriello, 1982). Although there is agreement that
types of criteria other than achievement criteria enter into evaluation proces-
ses in schools and classrooms (Thorndike, 1969), there is less agreement as to
which of these other types of criteria, such as participation, effort, and con-
duct (Natriello & McPartland, in press; Salganik, 1982; Schunk, 1983;
Weiner, 1979), may be appropriate.
Setting Standards for Student Performance
The fourth stage in the evaluation process has received considerable atten-
tion amidst renewed calls for higher standards in U.S. schools (National
Commission on Excellence in Education, 1983). Standards communicate the

level of performance that students are supposed to achieve.
Research and commentary on appropriate standards for the evaluation of
student performance in schools and classrooms can be seen as evidence of the
struggle to accommodate both universalism and individualism in a single sys-
tem (Bidwell, 1965; Varenne & Kelly, 1976; Waller, 1932). Out of this strug-
gle have emerged three types of standards, those set in reference to the crite-
rion level of a group, those set in reference to some absolute criterion level,
and those set in reference to the previous criterion level of an individual
(Rheinberg, 1983; Thorndike, 1969; Wise & Newman, 1975). Discussion of
standards for the evaluation of students revolve around the advantages and
disadvantages of employing each of these three types of standards (Bresee,
1976; Deutsch, 1979; Glaser, 1963; Glass, 1978; Levine, 1976; Michaels,
1977; Natriello & McPartland, in press; Slavin, 1977; Terwilliger, 1977).
Sampling Information on Student Performance
The fifth stage in the evaluation process involves the collection of partial in-
formation on student performance of assigned tasks and the outcomes of
those performances. The collection of such information requires a sampling
process because it would be impractical, if not impossible, to collect total in-
formation on student performance. Most of the important decisions about
the collection of performance information thus involve sampling decisions to
insure that the information collected provides a valid and reliable estimate of
performance appropriate to the purposes, tasks, criteria, and standards that
have already been determined.
By far the dominant technique for collecting information on student per-
formance is some form of testing. A number of analysts have contributed im-
portant observations about the relationship between testing practices and the
purposes, tasks, criteria, and standards for the evaluation of students. For
example, Deutsch (1979) objected to the overwhelming use of tests em-
ploying norm-referenced standards for the purpose of selection at the ex-
pense of student motivation and individual development. Others have re-
jected norm-referenced tests in favor of criterion-referenced tests for the
purpose of certification (Glaser, 1963; Hambleton, Swaminathan, Algina, &
Coulson, 1978; Popham & Husek, 1969). The relationship between tests and
assigned tasks and the biases that result when tests do not correspond to the
curriculum have also been given serious attention (Leinhardt & Seewald,
1981; National Institute of Education, 1979; Rudman et al., 1980). Descrip-
tive accounts of testing reveal a wide range of testing practices, and the use of
tests from various sources for multiple purposes together with evidence that
the level of expertise for test construction among teachers may be quite low
(Gullickson, 1982, 1984; Herman & Dorr-Bremme, 1984; Natriello, 1982).
Alternatives to traditional testing have also been examined, including
routine class and homework assignments, classroom interaction during
question-and-answer sessions, recitations, discussions, oral reading, prob-
lem solving at the chalkboard, special projects, presentations, and reports
(Gaston, 1976; Heller, 1978; Herman & Dorr-Bremme, 1984). Although such
practices appear to broaden the base of information on student performance,
there are serious questions about the quality of the information they provide
(Rudman et al., 1980).
Appraising Student Performance
The sixth stage involves comparing the information collected on student

performance on assigned tasks with the criteria and standards previously
established for those tasks. Studies of teacher appraisals of student
performance have typically focused on teacher bias related to student charac-
teristics. These studies may be criticized on a number of grounds (Natriello &

Dornbusch, 1984) and have not led to cumulative knowledge of the appraisal
process. Egan and Archer (1985) pointed out that the decision to examine
teacher appraisal of students using experimental models of prejudice bor-
rowed from social psychology (e.g., Rosenthal & Jacobson, 1968) is in con-
trast to the study of diagnosis in other professions where accuracy and ration-
ality of the appraisal process are assumed and interest is directed to the
strategy of the appraisal process. Egan and Archer concluded that there is lit-
tle basis to claim that teachers' ratings are inaccurate as it has not been possi-
ble to produce a rational strategy of classification that gives substantially bet-
ter results.
Providing Feedback on Student Performance
The seventh stage of the model involves the communication of the results of
the evaluation to relevant parties, including the student, parents, school offi-
cials, and potential employers (Ahmann & Glock, 1967). Designating feed-
back as a distinct stage serves to underline the point that a good deal of
evaluative information is never communicated to performers or other rele-
vant parties.
The nature and extent of communications regarding student performance
have been the subject of various investigations and commentaries. Some of
these have focused on the various forms of feedback from traditional report
cards (Chansky, 1975; Jarrett, 1963; Walling, 1975) to checklists (Rudman,
1978) to graded tests (Gullickson, 1982) to conferences (Ediger, 1975; Natri-
ello, 1982). Other investigations have considered the relationship of feedback
techniques to other dimensions of the evaluation process. The relationship
between the type of feedback and the purpose of the evaluation process has
received the attention of numerous authors (Cross & Cross, 1980; Hansen,
1977; Lissman & Paetzoid, 1983; Oren, 1983; Slavin, 1978). Several investi-
gators have considered the relationship between task characteristics and the
nature of feedback (Lintner & Ducette, 1974; Lissman & Paetzoid, 1983).
Monitoring the Outcomes of the Evaluation of Students.
Finally, the eighth stage involves consideration of the impact of the evalua-
tion process in light of the original purposes of the process. The purposes of
certification, selection, direction, and motivation might suggest an analysis
of mastery, classification, progress, and continued engagement, respec-
tively. This eighth stage of the evaluation process leads back to the first stage
of establishing or reestablishing the purpose for evaluating students as the cy-
cle continues.
Of course, the eight stages of the model are an oversimplification of real-
ity. It might be argued that later stages of the model have an impact on earlier
stages. For instance, some would argue that the criteria and standards set for
a task really define the task assignment or that the constraints of the sampling
process help to define the real criteria and standards. Moreover, although the
first six stages of the model are portrayed as having an impact on outcomes
only through the mediation of the feedback process designated as the seventh
stage, in reality each stage of the process may have direct effects on the out-
comes of the evaluation process, as is shown in the next section. Limitations
such as these notwithstanding, the model does highlight some key elements of
evaluation processes and provides a set of broad categories within which to
consider the impact of various features of evaluation processes on students.
THE IMPACT OF THE EVALUATION

PROCESS ON STUDENTS
The impact of various features of evaluation processes on students has been

the focus of numerous studies. Considering these studies in terms of the
stages of the conceptual model outlined above places them in a broader con-
text and indicates where additional research is needed.
Purposes of Evaluations
As previously noted, the relationship between the purposes of evaluation

systems and the impact or outcomes of evaluation processes has not recieved
systematic attention. Indeed, most studies of the effects of evaluations pit
two or more systems seemingly developed for different purposes against one
another in an attempt to determine their impact in terms of a narrowly de-
162 NATRIELLO
fined outcome (e.g., Lissman & Paetzoid, 1983; Schunk, 1983; Williams,
Pollack, & Ferguson, 1975). As a result, little is known about the develop-
ment and implementation of evaluation systems in school and classroom
contexts where evaluation must serve multiple purposes.
Resolution of Tasks
Investigators are only beginning to recognize the importance of classroom

tasks in understanding educational and evaluation processes (Doyle, 1983).
Although there are few studies in this area, a particularly interesting line of
research focuses on the impact of classroom task structure on students' con-
ceptions of the distribution of ability in the class. In a study of fifth and sixth
graders, Rosenholtz and Wilson (1980) found that in classes characterized by
what they called higher "resolution" (i.e., less task differentiation, more abil-
ity grouping, more evaluations comparing the work of one student with an-
other, and less student autonomy to choose tasks) there was higher concur-
rence among classmates, between self and classmates, between teacher and
classmates, and between self and teacher in ratings of reading ability.
Rosenholtz and Rosenholtz (1981) found that these same high-resolution
classroom structures led to more dispersed evaluations of reading ability by
students themselves, by classmates, and by teachers. In addition, they also
found that low-resolution classroom structures diminished the effect of
teacher evaluation on peer evaluations of an individual's reading ability. In a
study of third-grade classrooms, Simpson (1981) found that low levels of cur-
ricular differentiation led to " . . . a more nearly normal distribution of self-
reports of ability by increasing the proportion of students reporting ability
levels below average and far below average" (p. 127). Moreover, low curricu-
lar differentiation also appeared to lead to a more generalized view of aca-
demic ability, to greater peer consensus about students' performance levels,
and to greater influence for peers on an individual's self-reported ability.
Clarity of Criteria
Dornbusch and Scott (1975) made the point that criteria add to the
definition of the assigned task and direct the attention of performers to the
key elements of the task for which they will be held accountable. Schunk
(1983) reported on a study in which some children were offered rewards for
participating in a task, others were offered rewards for careful work on the
task, and still others were not offered rewards until they had completed the
task. The results indicated that the first group of children, those who had re-
ceived both a task assignment and information on the criteria for perform-
ance, showed the highest levels of skill, self-efficacy, and rapid problem
solving.
This should not be surprising. As Deutsch (1979) pointed out,

Students are in a bewildering position if a teacher marks them without telling
them in sufficient detail the values, rules, and procedures employed in his or her
grading. In such a situation, the mark-oriented students are necessarily anx-
iously dependent on the teacher's approval, since they have no other basis for
guiding their behavior to achieve merit. . . . Where the instructor is explicit in
his or her style of grading, the student can be more independent of the teacher.
( P 396)
Natriello (1982) found that over 30% of the students in his study of four sub-
urban high schools reported that they had received unsatisfactory evalua-
tions because they had misunderstood the criteria by which they were to be
evaluated. Smith (1984) observed that clarity has been demonstrated to be an
important component of teaching in research on teaching effectiveness
(Rosenshine & Furst, 1973). In his study of the impact of teacher "use of un-
certainty phrases" on student achievement, Smith (1984) found that such

phrases negatively affected achievement.
However, explicitness may have undesirable effects as well. Deutsch (1979)
noted that explicit evaluation systems may lead the mark-oriented student to
limit his or her work to what is being assessed by the procedures employed in
the grading or to attempt to outwit the procedures. He cited as an example
managers in the Bell System who are graded or evaluated by "profit indices"
and who often outwit the system by postponing routine maintenance costs
which results in equipment breakdowns several years later when successful
managers have moved on to new positions. Thus, clarity seems t o be a two-
edged sword.
Demandingness of Standards
The effects of performance standards seem to be more complex than is typi-

cally thought. Early studies of the impact of school standards on student per-
formance (Brookover & Schneider, 1975) seem to have survived the challenge
that the correlation between teacher standards and student performance
could result from the impact of the latter on the former (Crano & Mellon,
1978). Findings from the school effectiveness literature (Purkey & Smith,
1983), the teacher expectations literature (Brophy & Evertson, 1981), and the
task goals literature (Locke, 1968; Rosswork, 1977) suggest that higher
standards yield better student performance. In studies specifically focused on
evaluation processes, Natriello and Dornbusch (1984) found that higher
standards led to greater student effort on school tasks and to students being
more likely to attend class, and Natriello and McDill(1986) found that when
teachers had standards for homework, students were more likely to spend
time on homework.
164 NATRIELLO
However, the effects of higher standards may not be uniformly positive

under all conditions. Natriello (1982) found that students who perceived
standards for their performance as unattainable were more likely to become
disengaged from high school. McDill, Natriello, and Pallas (1985) suggested
that higher standards may actually have detrimental effects for at risk stu-
dents in secondary schools. There seems to be a curvilinear relationship be-
tween the level of standards and student effort and performance. The goal
would seem to be to challenge students without frustrating them (Atkinson,
1958).
Referents of Standards
The impact of different types of standards has also been investigated. Per-
haps the most attention has been devoted to norm-referenced standards or
"grading on the curve." Michaels (1977) designated the reward structure asso-
ciated with this practice as "individual competition, in which grades are as-
signed to students based on their performances relative to those of their class-
mates" and distinguished it from "individual reward contingencies, in which
grades are assigned to students on the basis of how much material each stu-
dent apparently masters" (p. 87). He considered the effects of these two re-
ward structures along with two other reward structures (group competition
and group reward contingencies) on student academic performance. In re-
viewing the relevant literature, he concluded that individual competition
consistenly produces superior academic performance.
However, Michaels (1977) observed that the superior academic perform-
ance found to be associated with individual competition may be limited to the
top third of the class, to those students who are most responsive to the reward
structure. Deutsch (1979) presented a more critical analysis of individual
competition or grading on the curve, a situation he described as an artificially
created shortage of good grades. He argued that the "disappointing rewards,
induced by an artificial scarcity, are likely to hamper the development of edu-
cational merit and the sense of one's own value" (p. 394). Moreover, under
individual competition, "students are more anxious, they think less well of
themselves and of their work, they have less favorable attitudes toward their
classmates and less friendly relations with them, and they feel less of a sense
of responsibility toward them" (p. 399).
In considering the impact of individual competition and individual reward
contingencies on actual student performance, Deutsch disagreed with the
conclusions reached by Michaels. Examining the same studies examined by
Michaels, Deutsch (1979) concluded that a number of these studies were
flawed because they did not equate the objective probability of reward in the
reward structures being compared. Deutsch's reanalysis of these studies
showed "no systematic differences in performance on isolated work under
several different reward systems" (p. 398). This position was confirmed by
Williams et al. (1975), who found no significant differences between the
achievement and self-reported attitudes or school-related behavior of stu-
dents exposed to norm-referenced and criterion-referenced standards. How-
ever, they also found that criterion-referenced standards provided assurance
to students who performed poorly initially that enabled at least some of them
to increase their performance on later tests, and that criterion-referenced
standards allowed students who did well initially to become confident and
work less than students working under a norm-referenced system.
Norm-referenced standards have also been compared to individually refer-
enced standards for their effects on student performance. Slavin (1980)
found that students in classes in which evaluations were based on experimen-
tal individually referenced standards achieved more on a final standardized
test than students in control classes evaluated by norm-referenced standards.
However, Beady, Slavin, and Fennessey (1981) found no differences in the
effects of norm-referenced standards and individually referenced standards
among students participating in a program of focused instruction, a particu-

lar model of direct instruction. Under different task conditions Rheinberg
(1983) found that students working under individually referenced standards
showed more realistic strategies of goal setting, more often attributed their
successes to their own effort, and performed better than students working
under norm-referenced standards.
In an interesting complication to the issue of the impact of different
standards, Bolocofsky and Mescher (1984) considered the effects of different
standards for students who differ in terms of self-esteem and locus of con-
trol. They found that students with different characteristics performed dif-
ferently under different kinds of standards. Self-referenced standards
worked best with students with low self-esteem and internal locus of control.
Criterion-referenced or absolute standards worked best with students with
low self-esteem and external locus of control. Norm-referenced standards
worked best with students with high self-esteem, regardless of locus of
control.
A great deal of attention has been devoted to the impact of different types
of standards on student cooperation and competition. These studies typically
examine the relationships between the evaluations made and rewards
distributed and the tendency for students to perform tasks independently,
cooperatively, or competitively. Slavin (1977), in a review of much of this re-
search, used the term "interpersonal reward structure" (p. 634) to refer to the
dependence or lack of dependence of any given student on any other student.
He distinguished three types of interpersonal reward structures: competitive
reward structures, where the probability of one student receiving a reward
is negatively related to the probability of other students receiving a reward;
independent reward structures, where the tendency of one student receiving a
reward is unrelated to the tendency of other students receiving a reward; and
cooperative reward structures, where the probability of one student receiving
166 NATRIELLO
a reward is positively related to the probability of other students receiving a

reward.
Slavin (1977) reviewed the research on the impact of these various reward
structures on student social behavior and academic performance in the class-
room. He concluded that cooperative structures enhance social behavior
along a number of dimensions, including, interpersonal attraction, friend-
liness, positive group evaluation, helpfulness, and cross-racial interaction.
Although competitive and independent reward structures were found to be
more effective in increasing performance when tasks required little coopera-
tion or when there was little opportunity to share resources to facilitate per-
formance, cooperative structures should be effective in promoting perform-
ance when such cooperation and sharing are necessary and permitted.
Frequency of Sampling
A number of investigations have focused on the frequency of the sampling

process, more particularly, the frequency of testing. Reviewers of the re-
search on the frequency of testing (Feldhusen, 1964; Peckham & Roe, 1977)
have found that although early studies of testing frequency indicated uni-
formly positive effects on student learning and motivation of more frequent
testing, more recent studies incorporating more variables have suggested that
more frequent testing may not be beneficial for all students in all contexts.
However, considering evaluation activities as contests, Deutsch (1979) con-
cluded that "the existence of many diverse contests diffuses competition and
reduces the negative implications of any particular contest: It is less harmful
to one's self esteem and social standing" (p. 396).
Studies of testing frequency have not typically viewed testing as part of a
larger evaluation process. From the perspective of the model outlined above,
testing is merely one method of sampling student performance and out-
comes. Thus, it is difficult to understand the impact of testing frequency in
the classroom apart from information on the frequency with which teachers
use other methods to collect information on student performance.
Soundness of Appraisals
Consideration of the appraisal process focuses attention on the connection

between student performances and the evaluations made of those perform-
ances by teachers, often from the perspective of the teacher attempting care-
fully to relate performance information to predetermined tasks, criteria, and
standards. The quality of the connection between student performance and
evaluations appears to have important effects on students. Natriello and
Dornbusch (1984) found that when students perceived the evaluations of
their performance on school tasks to be unsound (i.e., not to accurately re-
flect their effort and performance), they were less likely to consider these
evaluations important and less likely to devote effort to the associated tasks.
An interesting complication of these effects is found in work on the theory
of learned helplessness, which suggests that experiencing uncontrollable out-
comes should depress performance (Abramson, Seligman, & Teasdale, 1978;
Seligman, 1975), as well as in work that suggests that the experience of
uncontrollable outcomes facilitates increased performance by producing an
increased need for control (Roth & Bootzin, 1974; Thornton & Jacobs,
1972). An integrative model developed by Wortman and Brehm (1975) sug-
gests that brief exposure to uncontrollable outcomes will lead to improved
performance, whereas extended exposure will lead to decreased perform-
ance. Research involving high school students (Buys & Winfield, 1982) re-
veals only decreased student performance in reaction to the experience of
uncontrollable outcomes, a pattern the authors link to the relatively less self-
reliant and less self-confident nature of high school students compared to
adults and to the tendency of the school environment to foster helplessness.

Of course, students may differ in their perceptions of appraisal processes
independent of the process itself. Taking a developmental approach, Evans
and Engelberg (1985) found that older and higher-achieving students under-
stood grading practices better than younger and lower-achieving students
and that younger and lower-achieving students were more likely to attribute
grades to external and uncontrollable factors, whereas high achievers and
older students attributed grades to internal and controllable factors.
Differentiation of Feedback
A number of studies have examined the impact of the feedback presented as

part of the evaluation process. Stewart and White (1976) presented the results
of their own study and reviewed those of 12 others that attempted to replicate
Page's (1958) classic study of the effects of grades alone versus the effects of
grades and teacher comments as forms of feedback. Page found that "when
the average secondary teacher takes the time and trouble to write comments
(believed to be 'encouraging') on student papers, these apparently have a
measurable and potent effect upon student effort, or attention, or attitude,
or whatever it is which causes learning to improve . . . " (pp. 180-181).
Stewart and White (1976) reached a slightly less confident conclusion, noting
that the positive effect obtained by Page may depend on the particular learn-
ing conditions and the nature of the teacher comments. Cross and Cross
(1980) found that personalized encouraging comments from the teacher used
in addition to a grade on tests and assignments enhanced the "internality" of
students in an inner-city junior high school.
Feedback may also have effects in schools and classrooms that extend be-
yond the individual students to whom the feedback pertains. Because feed-
back is often given publicly, it may have effects on other students as well. A
study of third graders by Simpson (1981) illustrated how evaluative feedback
decisions can have an effect on students' perceptions of the ability levels of
their classmates. Simpson argued that
Grades are singular symbols taking on unidimensional comparative meaning
from the abstract numerical system which defines them. Frequent grading is ca-
pable of reducing even relatively complex performances to a single dimension,
because grades reduce information to numbers, because these numbers can be
averaged, and because teachers and student peers can use these numbers to
place students on a single global stratification scale. (p. 124)
Simpson found that in classrooms where teachers report "always" or "usu-

ally" grading student work (as opposed to those in which they "never"
orUseldom" grade such work) and where they report using few kinds of in-
structional materials and where they seldom use alternative media and sel-
dom allow students to choose their tasks, there is a higher dispersion among
students' reported ability levels, higher generalization of students' reported
ability levels, higher peer consensus as to students' relative performance lev-
els, and greater peer influence over student's reported ability levels. Thus, the
use of less differentiated forms of feedback such as grades seems to lead to
more pronounced and more powerful ability stratification processes in the
classroom.
A similar effect on the distribution of attributional tendencies in class-
rooms was found by Oren (1983). Oren explored the effects of evaluation
feedback on the attributional tendencies of students. Results indicated that in
classrooms with differentiated, specific, and individualized feedback, the
attributional tendencies of low achievers were more like those of high achiev-
ers. Specifically, low achievers in such classrooms scored higher on internal
control than low achievers in classrooms with less differentiated feedback
systems.
Affective Value of Feedback
The affective value of feedback has also been shown to influence attribu-
tions in classrooms. Meyer et al. (1979) reported on a series of six experimen-
tal studies that investigated the extent to which praise and criticism in re-
sponse to task performance provided information about others' perceptions
of a focal actor's ability. In these studies, subjects were presented with de-
scriptions of two students who had obtained identical results at a task. One of
the students received neutral feedback while the other was praised for success
or criticized for failure. Studies using adult subjects revealed that praise after
success and neutral feedback after failure led to the perception that the focal
actor's ability was low, and neutral feedback for success and criticism after
failure led to the perception that the focal actor's ability was high. However,
these findings varied by the age of the respondents. For example, third-grade
students believed that the student praised by the teacher was the brighter one;
students in Grades 4 to 7 selected the praised student and the student receiv-
ing neutral feedback in approximately equal numbers; and students in
Grades 8 and above believed that the student receiving neutral feedback was
brighter than the one receiving positive feedback following successful per-
formance. Although the effects of feedback in the classroom appear to be
powerful, they are multidimensional and complex. Simple injunctions to in-
crease feedback for one purpose or another are likely to set in motion a range
of processes that are in need of further examination.
Consistency Across Stages of the Evaluation Process

Although the studies of the effects of features of the evaluation process just
noted have suggested some possible consequences for certain individual fea-
tures, little attention has been devoted to developing an understanding of en-
tire evaluation systems composed of purposes, tasks, criteria, standards,
samples, appraisals, and feedback. One of the key issues to be examined in
thinking about systems of evaluation is the relationship between various as-
pects of the process and the extent to which there is consistency among them.
For instance, evaluations and evaluation systems may differ in terms of the
consistency between task assignments and criteria set for the task. Some
teachers may take care that the performance criteria set for a task be appro-
priate to the nature of the task assignment but others may not. In the latter
case a teacher may designate a task as a creative opportunity when an assign-
ment is made but hold students accountable for a formulaic set of criteria. A
second instance might be the consistency between the criteria and standards
set for the task and the process of sampling student performances and out-
comes. For example, a teacher may specify criteria related to the actual per-
formance of the task (e.g., how to proceed to solve a math problem) but only
sample the outcome of the performance (e.g., the correctness of the answer).
Although little research has been conducted to examine the actual extent to
which teachers implement a consistent system of performance evaluation for
students, interviews conducted by Natriello (1982) with secondary school
teachers suggest that teachers vary widely in their ability to articulate a sys-
tematic approach to the evaluation of student performance. Examinations of
teacher preparation curricula, which indicate that prospective teachers re-
ceive little or no training in the evaluation of student performance (Mayo,
1967; Roeder, 1973), suggest that this finding may be widely applicable. The
effects of this lack of consistency could be quite negative. Natriello (1982) re-
ported that high school students who experienced more inconsistencies in the
evaluation system were also more likely to become disengaged from school.
RESEARCH ON EVALUATION PROCESSES:

PAST PATTERNS AND FUTURE DIRECTIONS
Previous studies on evaluation processes in schools and classrooms have

been characterized by three features that limit their utility to practitioners
and researchers interested in the accumulation of knowledge about this key
educational process. First, relatively little descriptive information on evalua-
tion processes in schools and classrooms has been considered in designing ef-
fects studies even though many studies seek to create new knowledge as the
basis for improved practice. For example, more than a few studies seek to de-
velop alternatives to norm-referenced standards, but descriptive accounts
suggest that such standards may not be used extensively by teachers at the
present time (Natriello & McPartland, in press).
Second, most of the effects studies concentrate on one or two aspects of
the evaluation process. As a result, they fail to consider the impact of other
key elements in determining the effects of evaluations. The conclusions
drawn from such studies consider the nature of the assigned tasks upon
which students are being evaluated, yet it is clear that task differences condi-
tion the impact of evaluation processes (Doyle, 1983).
Third, few of the effects studies consider the multiple purposes for evalua-
tions in schools and classrooms. As a result, they often compare different
evaluation methods in terms of some outcome that has nothing to do with the
purpose for which one of the methods was developed. For instance, a study
demonstrating that differentiated feedback contributes more to directing fu-
ture student performance than a single letter grade may be doing nothing
more than showing that an evaluation system created for the purpose of pro-
viding direction to students does a better job of providing that direction than
another evaluation system created for the purpose of selecting students.
The limitations of previous studies of the impact of evaluation processes
on students suggest important directions for further research. Research is
needed on the basic patterns of evaluation practices in schools and class-
rooms. Investigators have typically begun with some common assumptions
about the current state of practice as they planned intervention studies of
evaluation processes. However, additional research is needed to provide a
better descriptive account of how students are currently evaluated in schools
and classrooms.
Research on evaluation practices in schools and classrooms will need to
consider explicitly which of the multiple purposes of evaluation processes can
be served by which combinations of practices. For example, previous re-
search suggests that the design of an evaluation system for the purpose of
enhancing student motivation might involve a differentiated task structure in
the classroom, a mix of more and less predictable tasks, clearly articulated
criteria, challenging yet attainable, self-referenced standards, relatively fre-
quent collection of information on student performance, appraisals that
truly reflect student effort and performance, and differentiated and encour-
aging feedback. An evaluation system designed for purposes of certification
would look quite different. Researchers should be sensitiveto the purposes of
evaluation systems when they examine existing evaluation arrangements,
which typically involve compromises among the competing demands of mul-
tiple purposes. They should also be aware of the multiple purposes served by
evaluation systems when they design interventions to achieve certain pur-
poses at the expense of neglecting other purposes that must be attended to in
operating schools and classrooms.
Research on evaluation practices might be improved considerably if inves-
tigators moved beyond the study of particular elements of the evaluation

process such as frequency of testing and approached the evaluation process
as a system involving, at a minimum, tasks, criteria, standards, samples of
information on performance and outcomes, appraisals, and feedback. Stud-
ies based on programs of intervention (e.g., Rosenholtz & Wilson, 1980)have
employed this kind of more comprehensive strategy, and teachers confronted
with the demands of classrooms will always have to consider the full range of
these issues in developing an approach to student evaluation. In this case,
both research and practice will be improved by adopting a more comprehen-
sive framework for studies of the evaluation process.
Finally, future research on evaluation processes should explicitly consider
four basic types of evaluative situations that occur in schools. Many studies
that have been conducted assume that a single student is being evaluated by a
single teacher (e.g., Bolocofsky & Mescher, 1984). These studies typically ex-
amine the impact of various evaluation practices on students as individuals
who happen to be in classrooms. Yet aside from tutorial situations, these
conditions almost never occur in schools and classrooms. The findings of
these studies would be difficult for most teachers to apply in their daily work.
Fewer studies have focused on the conditions found in the typical self-
contained elementary classroom in which a single teacher evaluates multiple
students (e.g., Rosenholtz & Wilson, 1980). These studies also examine the
impact of evaluation processes on the student group. Fewer studies still have
been concerned with the situation in which a single student is confronted with
evaluations from multiple teachers (e.g., Natriello, 1982), the situation in
which most secondary school students find themselves. These studies focus
on the plight of the student in a complex authority system. Finally, almost no
studies have examined the situation confronted by educators attempting to
manage an evaluation system in which multiple teachers evaluate multiple
172 NATRIELLO
students, the situation confronting educators attempting to design and man-

age evaluation processes in most secondary schools. If researchers expect ed-
ucators to utilize the results of research on the evaluation of students, studies
must be designed that grapple with the complexity presented by the multiple
evaluator, multiple student situation. Very few of the findings gleaned from
studies of single evaluators and single students can be applied in a straightfor-
ward way to the typical school and classroom setting.
REFERENCES
Abramson, L. Y., Seligman, M. E. P., & Teasdale, J. D. (1978). Learned helplessness in
humans: Critique and reformulation. Journal of Abnormal Psychology, 87, 49-74.
Ahmann, J. S., & Clock, M. D. (1967). Evaluating pupil growth: Principles of tests and
measurements (4th ed.). Boston: Allyn & Bacon.
Armbruster, B. B., Stevens, R. J., & Rosenshine, B. (1977). Analyzing content coverage and
emphasis: A study of threecurricula andtwo tests (Tech. Rep. No. 26). Urbana: University of
Illinois, Center for the Study of Reading.
Atkinson, J. W. (1958). Towards experimental analysis of human motivation in terms of
motives, expectancies, and incentives. In J. W. Atkinson (Ed.), Motivesin fantasy, action and
society (pp. 273-306). Princeton, NJ: Van Nostrand.
Beady, C. J., Jr., Slavin, R. E., & Fennessey, G. M. (1981). Alternative student evaluation
structures and a focused schedule of instruction in an inner-city junior high school. Journalof
Educational Psychology, 75, 5 18-523.
Bidwell, C. E. (1965). The school as a formal organization. In J . G. March (Ed.), Handbook of
organizations (pp. 972-1022). Chicago: Rand McNally.
Bolocofsky, D. N., & Mescher, S. (1984). Student characteristics: Using student characteristics
to develop effective grading practices. The Directive Teacher, 6, 11-23.
Bresee, C. W. (1976). On "grading on the curve." The Clearing House, 5, 108-1 10.
Brookover, W. B., & Schneider, J. M. (1975). Academic environments and elementary school
achievement. Journal of Research and Development in Education. 9. 82-91.
Brophy, J., & Evertson, C. (1981). Student characteristicsand teaching. New York: Longman.
Brown, D. J. (1971). Appraisal procedures in the secondary schools. Englewood Cliffs, NJ:
Prentice-Hall.
Buys, N., & Winfield, A. H. (1982). Learned helplessness in high school students following
experience of noncontingent rewards. Journal of Research in Personality, 6, 6-9.
Chansky, N. M. (1975). A critical examination of school report cards from K through 12.
Reading Improvement, 12, 184-1 92.
Crano, W. D., & Mellon, P. M. (1978). Causal influence of teachers' expectations on children's
academic performance: A cross-lagged panel analysis. Journal of Educational Psychology,
70, 39-49.
Crooks, A. D. (1933). Marks and marking systems: A digest. Journalof Educational Research,
27, 259-272.
Cross, L. J., & Cross, C. M. (1980). Teachers' evaluative comments and pupil perception of
control. Journal of Experimental Education, 49,68-7 1.
Davis, R. G,, & McKnight, C. (1976). Conceptual, heuristic, and S-algorithmic approaches in
mathematics teaching. Journal of Children's Mathematical Behavior, I(Supp1. I), 271-286.
Deutsch, M. (1979). Education and distributive justice: Some reflections on grading systems.
American Psychologist, 34, 391-401.
Dornbusch, S. M., & Scott, W. R. (1975). Evaluation and the exercise of authority. San
Francisco: Jossey-Bass.
Doyle, W. (1983). Academic work. Review of Educational Research, 53, 159- 199.
Ediger, M. (1975). Reporting pupil progress: Alternatives to grading. Educational Leadership,
32, 265-267.
Egan, 0.. &Archer, P . (1985). The accuracy of teachers'ratings of ability: A regression model.
American Educational Research Journal, 22, 25-34.
Evans, E. D. & Engelberg, R. A. (1985, April). A developmentalstudy of student perceptions of
school grading. Paper presented at the biennial meeting of the Society for Research on Child
Development, Toronto.
Feldhusen, J. F. (1964). Student perceptions of frequent quizzes and post-mortem discussions of
tests. Journal of Educational Measurement, 1, 5 1-54.
Gaston, N. (1976). Evaluation in the affective domain. Journal of Business Education, 52,
134-136.
Glaser, R. (1963). Instructional technology and the measurement of learning outcomes. Ameri-
can Psychologist, 18, 5 19-521.
Glass, G. V. (1978). Standards and criteria. Journal of Educational Measurement 15, 237-261.
Gullickson, A. R. (1982). Thepractice of testing in elementary andsecondary schools. Unpub-
lished manuscript. (ERIC Document Reproduction Service No. ED 229391)

Gullickson, A. R. (1984). Teacher perspectives of their instructional use of tests. Journalof Edu-
cational Research, 77, 244-248.
Hackman, J. R. (1969). Toward understanding the role of tasks in behavioral research. Acta
Psychologica, 31, 97-128.
Hambleton, R. K., Swaminathan, H., Algina, J., &Coulson, D. B. (1978). Criterion-referenced
testing and measurement: A review of technical issues and developments. Review of Educa-
tional Research, 48, 1-47.
Hansen, J. M. (1977). Personalized achievement reporting: Grades that are significant. The
High School Journal, 60, 255-263.
Heller, L. (1978). Assessing the process and the product: An alternative to grading. English
Journal, 6. 66-69.
Herman, J., & Dorr-Bremme, D. W. (1984). Testing and assessment in American public
schools: Current practices and directions for improvement. Los Angeles: University of
California at Los Angeles, Center for the Study of Evaluation.
Holmes, M. (1978). Evaluating students in the affective domain. School Guidance Worker, 33,
50-58.
Jarrett, C. D. (1963). Marking and reporting practices in the American secondary school.
Peabody Journal of Education, 41, 36-48.
Leinhardt, G., & Seewald, A. M. (1981). Overlap: What's tested, what's taught? Journal ofEdu-
cational Measurement, 18, 85-96.
Levine, M. (1976). The academic achievement test: Its historical context and social functions.
American Psychologist. 31,228-238.
Lien, A. J. (1967). Measurement and evaluation of learning: A handbook for teachers.
Dubuque, IA: Brown.
Linn, R. L. (1983). Testing and instruction: Links and distinction. Journal of EducationalMeas-
urement, 20, 179- 189.
Lintner, A. C., & Ducette, J. (1974). The effects of locus of control, academic failure and task
dimensions on a student's responsiveness to praise. American Educational Research Journal,
11, 231-239.
Lissman, U., & Paetzoid, B. (1983). Achievement feedback and its effects on pupils-A quasi-
experimental and longitudinal study of two kinds of differential feedback, norm-referenced
and criterion-referenced feedback. Studies in Educational Evaluation, 9,209-222.
Locke, E. A. (1968). Toward a theory of task motivation and incentives. Organizational Behav-
ior and Human Performance, 3, 157-189.
Mayo, S. T. (1967). Pre-servicepreparation of teachers in educational measurement. Chicago:
Loyola University.
McDill, E. L., Natriello, G., & Pallas, A. M. (1985). Raising standards and retaining students.
Review of Educational Research, 55, 415-434.
Meyer, W., Bachmann, M., Biermann, U., Hempelmann, M., Ploger, F., & Spiller, H. (1979).
The informational value of evaluative behavior: Influences of praise and blame on percep-
tions of ability. Journal of Educational Psychology 71, 259-268.
Michaels, J. W. (1977). Classroom reward structures and academic performance. Review ofEd-
ucational Research, 47, 87-98.
National Institute of Education. (1983). A nation at risk: The imperative for educational reform
(Rep. No. 83-B2425). Washington, DC: U.S. Government Printing Office.
National Commission on Excellence in Education. (1979). Testing, teaching, and learning.
Washington, DC: Author.
Natriello, G. (1982). Organizationalevaluationsystems and student disengagement in secondary
schools (Final Report to the National Institute of Education). St. Louis, MO: Washington
University.
Natriello, G. (1985). Merit pay for teachers: The implications of theory for practice. In H. C .
Johnson, Jr., (Ed.), Merit, money and teachers'careers (pp. 99-120). Sanham, MD: Univer-
sity Press of America.
Natriello, G., & Dornbusch, S. M. (1984). Teacherevaluativestandardsandstudenteffort. New
York: Longman.
Natriello, G., & McDill, E. L. (1986). Performance standards, student effort on homework and
academic achievement. Sociology of Education, 59, 18-3 1.
Natriello, G., & McPartland, .I.(in press). Adjustments in high school teachers'grading criteria:
Accommodation or motivation? Baltimore: Johns Hopkins University, Center for the Social
Organization of Schools.
Oren, D. L. (1983). Evaluation systems and attributional tendencies in the classroom: A socio-
logical approach. Journal of Educational Research, 76, 307-312.
Page, E. B. (1958). Teacher comments and student performance: A seventy-four classroom ex-
periment in school motivation. Journal of Educational Psychology, 49, 173- 181.
Peckham, P. D., & Roe, M. D. (1977). The effects of frequent testing. JournalofResearch and
Development in Education, 10, 40-50.
Popham, W. J., & Husek, T. R. (1969). Implications of criterion-referenced measurement.
Journal of Educational Measurement, 6, 1-9.
Purkey, S. C., & Smith, M. S. (1983). Effectiveschools: Areview. TheElementary SchoolJour-
nal, 83, 427-452.
Remmers, H. H., Gage, N. L., & Rummel, J . F. (1960). A practical introduction to measure-
ment and evaluation. New York: Harper & Brothers.
Rheinberg, F. (1983). Achievement evaluation: A fundamental difference and its motivational
consequences. Studies in Educational Evaluation, 9, 185-194.
Roeder, H. H. (1973). Teacher education curriculum-Your final grade is F. Journalof Educa-
tional Measurement, 10, 141-143.
Rosenholtz, S. J., & Rosenholtz, S. H. (1981). Classroom organization and the perception of
ability. Sociology of Education, 54, 132- 140.
Rosenholtz, S. J., & Wilson, B. (1980). The effect of classroom structure on shared perceptionr
of ability. American Educational Research Journal, 17, 75-82.
Rosenshine, B., & Furst, N. (1973). The use of direct observation to study teaching. In R. M. W.
Travers, (Ed.), Second handbook of research on teaching (pp. 122-183). Chicago: Rand
McNally.
Rosenthal, R., & Jacobson, L. (1968). Pygmalion in theclassroom. New York: Holt, Rinehart &
Winston.
Rosswork, S. G. (1977). Goal setting: The effects on an academic task with varying magnitudes
of incentive. Journal of Educational Psychology, 69, 7 10-7 15.
Roth, S., & Bootzin, R. R. (1974). The effects of experimentally induced expectancies of external
control: An investigation of learned helplessness. Journal of Personality and Social Psychol-
ogy, 28, 253-264.
Rudman, H. C., Kelly, J . L., Wanous, D. S., Mehrens, W. A., Clark, C. M., & Porter, A. C.
(1980). Integrating assessment with instruction: A review (1922-1980). East Lansing, MI:
Michigan State University, College of Education, Institute for Research on Teaching.
Rudman, M. K. (1978). Evaluating students: How to d o it right. Learning, 7, 50-53.
Salganik, L. H. (1982, March). The effects of effort marks on report card grades. Paper pre-
sented at the annual meeting of the American Educational Research Association, Los
Angeles.
Schunk, D. H. (1983). Reward contingencies and the development of children's skills and self-
efficacy. Journal of Educational Psychology, 75, 5 11-5 18.
Seligman, M. E. P. (1975). Helplessness: On depression, development, and death. San
Francisco: Freeman.
Simpson, C. (1981). Classroom structure and the organization of ability. Sociology of Educa-
tion, 54, 120-132.
Slavin, R. E. (1977). Classroom reward structure: An analytical and practical review. Review of
Educational Research, 47, 633-650.
Slavin, R. E. (1978). Separating incentives, feedback, and evaluation: Toward a more effective
classroom system. Educational Psychologist, 13, 97-100.
Slavin, R. E. (1980). Effects of individual learning expectations on student achievement. Journal
of Educational Psychology, 72, 520-524.
Smith, L. R. (1984). Effect of teacher vagueness and use of lecture notes on student perform-
ance. Journal of Educational Research, 78, 68-74.
Stewart, L. G., &White, M. A. (1976). Teacher comments, letter grades and student perform-
ance: What do we really know? Journal of Educational Psychology, 68, 488-500.
Terwilliger, J. G. (1977). Assigning grades-Philosophical issues and practical recommenda-
tions. Journal of Research and Development in Education, 10, 21-39.
Thompson, J. D. (1967). Organizations in action. New York: McGraw-Hill.
Thorndike, R. L. (1969). Marks and marking systems. In R. L. Ebel (Ed.), Encyclopedia of edu-
cational research (pp. 759-766). New York: Macmillan.
Thornton, J. W., & Jacobs, P. D. (1972). The facilitating effects of prior inescapable unavoida-
ble stress on intellectual performance. Psychometric Science, 26, 265-271.
Varenne, H., &Kelly, M. (1976). Friendship and fairness: Ideological tensions in an American
high school. Teachers College Record, 77, 601-614.
Waller, W. (1932). The sociology of teaching. New York: Wiley.
Walling, D. R. (1975). Designing a "report card" that communicates. Educational Leadership,
32, 258-260.
Weiner, B. (1979). A theory of motivation for some classroom experiences. Journal of Educa-
tional Psychology, 71, 3-25.
Williams, R. G., Pollack, M. J., & Ferguson, N. A. (1975). Differential effects of two grading
systems on student performance. Journal of Educational Psychology, 67, 253-258.
Wilson, S. (1976). You can talk to teachers: Student-teacher relations in an alternative high
school. Teachers College Record, 78, 77-100.
Wise, R. I., &Newman, B. (1975). The responsibilities of grading. EducationalLeadership, 32,
253-256.
Wortman, C. B., & Brehm, J. W. (1975). Responses to uncontrollable outcomes: Anintegration
of reactance theory and the learned helplessness model. In L. Berkowitz (Ed.), Advancesin ex-
perimental social psychology (Vol. 8, pp. 278-336). New York: Academic.

Natriello 1987

Uploaded by

Document Informationclick to expand document informationok

Document Informationclick to expand document information

Copyright:

Available Formats

Natriello 1987

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Natriello 1987

Uploaded by

Copyright:

Available Formats

This article was downloaded by: [Monash University Library]

On: 08 January 2015, At: 01:53

The Impact of Evaluation Processes on Students

To link to this article: http://dx.doi.org/10.1207/s15326985ep2202_4

PLEASE SCROLL DOWN FOR ARTICLE

The Impact of Evaluation

This review provides a synthesis of recent research on the evaluation of students

terable features of evaluation processes is considered. These features include

The evaluation of student performance is a central task of school administra-

A CONCEPTUAL FRAMEWORK FOR SCHOOL

Numerous frameworks for thinking about the evaluation of performers

The Evaluation of Students for Evaluating Students

Providina Feedback Assimina Tasks

n - e---- -ins Standards for

Figure 1 A model of evaluation processes in schools and classrooms.

gests some emerging issues concerning evaluation processes that confront

Establishing the Purposes for Evaluating Students

dents to be recommended or permitted to enter or continue along certain edu-

Assigning Tasks to Students

The second stage of the evaluation process is assigning tasks to students. It

The strain placed on evaluation processes in schools and classrooms by less

and administrators. Of course, this diminishes the likelihood that teachers

Setting Criteria for Student Performance

Setting Standards for Student Performance

Commission on Excellence in Education, 1983). Standards communicate the

Sampling Information on Student Performance

Appraising Student Performance

The sixth stage involves comparing the information collected on student

teristics. These studies may be criticized on a number of grounds (Natriello &

Providing Feedback on Student Performance

Monitoring the Outcomes of the Evaluation of Students.

THE IMPACT OF THE EVALUATION

The impact of various features of evaluation processes on students has been

As previously noted, the relationship between the purposes of evaluation

Investigators are only beginning to recognize the importance of classroom

This should not be surprising. As Deutsch (1979) pointed out,

certainty phrases" on student achievement, Smith (1984) found that such

The effects of performance standards seem to be more complex than is typi-

However, the effects of higher standards may not be uniformly positive

among students participating in a program of focused instruction, a particu-

a reward is positively related to the probability of other students receiving a

A number of investigations have focused on the frequency of the sampling

Consideration of the appraisal process focuses attention on the connection

adults and to the tendency of the school environment to foster helplessness.

A number of studies have examined the impact of the feedback presented as

Simpson found that in classrooms where teachers report "always" or "usu-

Affective Value of Feedback

Consistency Across Stages of the Evaluation Process

RESEARCH ON EVALUATION PROCESSES:

Previous studies on evaluation processes in schools and classrooms have

tigators moved beyond the study of particular elements of the evaluation

students, the situation confronting educators attempting to design and man-

lished manuscript. (ERIC Document Reproduction Service No. ED 229391)

You might also like