Establishing Validity for Performance-Based Assessments: An Illustration for Collections of Student Writing

1 March 1996

journal article
research article
Published by Taylor & Francis in The Journal of Educational Research

Vol. 89 (4) , 220-233
https://doi.org/10.1080/00220671.1996.9941207

Abstract

Techniques for establishing the reliability and validity of assessments of student writing are presented. Raters scored collections of elementary students' narrative writing with the holistic scales of two rubrics—a new rubric designed for classroom use and known to enhance teacher practice, and an established rubric for large-scale writing assessment. Comparisons of score reliabilities were based on three methods: percentage agreement, correlations between rater pairs, and generalizability studies. Comparisons of the evidence for validity of scores were based on (a) correlations of scores with results from two other methods of writing assessment, (b) developmental patterns across grade levels, and (c) consistency of decisions made across methods of assessment. Results were mixed; good evidence was provided for the reliability and developmental validity of the new rubric, but correlational patterns were not clear. The importance of establishing performance-based assessments of writing that are both technically sound and usable by teachers is discussed.

Keywords

This publication has 7 references indexed in Scilit:

Engaging teachers in assessment of their students' narrative writing: The role of subject matter knowledge
Assessing Writing, 1994
Assessing Writing Portfolios: issues in the Validity and Meaning of Scores
Educational Assessment, 1993
Linking Large-Scale Testing and Classroom Portfolio Assessments of Student Writing
Educational Assessment, 1993
Writing What You Read: Assessment as a Learning Event
Published by Test accounts ,1993
Portfolios, Accountability, and an Interpretive Approach to Validity
Educational Measurement: Issues and Practice, 1992
The Development and Use of Literacy Portfolios for Students, Classes, and Teachers
Applied Measurement in Education, 1991
A Coefficient of Agreement for Nominal Scales
Educational and Psychological Measurement, 1960