Examining Rater Errors in the Assessment of Written Composition With a Many‐Faceted Rasch Model
- 1 June 1994
- journal article
- Published by Wiley in Journal of Educational Measurement
- Vol. 31 (2) , 93-112
- https://doi.org/10.1111/j.1745-3984.1994.tb00436.x
Abstract
This study describes several categories of rater errors (rater severity, halo effect, central tendency, and restriction of range). Criteria are presented for evaluating the quality of ratings based on a many‐faceted Rasch measurement (FACETS) model for analyzing judgments. A random sample of 264 compositions rated by 15 raters and a validity committee from the 1990 administration of the Eighth Grade Writing Test in Georgia is used to illustrate the model. The data suggest that there are significant differences in rater severity. Evidence of a halo effect is found for two raters who appear to be rating the compositions holistically rather than analytically. Approximately 80% of the ratings are in the two middle categories of the rating scale, indicating that the error of central tendency is present. Restriction of range is evident when the unadjusted raw score distribution is examined, although this rater error is less evident when adjusted estimates of writing competence are usedKeywords
This publication has 19 references indexed in Scilit:
- Shifting Conceptions of Validity in Educational Measurement: Implications for Performance AssessmentReview of Educational Research, 1992
- The Measurement of Writing Ability With a Many-Faceted Rasch ModelApplied Measurement in Education, 1992
- Evaluation of Procedure‐Based Scoring for Hands‐On Science AssessmentJournal of Educational Measurement, 1992
- Complex, Performance-Based Assessment: Expectations and Validation CriteriaEducational Researcher, 1991
- Quality Control in the Development and Use of Performance AssessmentsApplied Measurement in Education, 1991
- Partial Credit Analysis of Writing AbilityEducational and Psychological Measurement, 1991
- Multiple uses of performance appraisal: Prevalence and correlates.Journal of Applied Psychology, 1989
- Applying Partial Credit Analysis to the Construction of Narrative Writing TestsApplied Measurement in Education, 1988
- Calibrating graded assessments: Rasch partial credit analysis of performance in writingLanguage Testing, 1987
- Rating the ratings: Assessing the psychometric quality of rating data.Psychological Bulletin, 1980