Comparison of Equipercentile and Item Response Theory Equating When the Scaling Test Method Is Applied to a Multilevel Achievement Battery
- 1 June 1983
- journal article
- Published by SAGE Publications in Applied Psychological Measurement
- Vol. 7 (3) , 267-281
- https://doi.org/10.1177/014662168300700303
Abstract
Test publishers generally choose an anchor or scal ing test approach to the development of a growth scale for a multilevel achievement battery. Although some studies have been conducted comparing traditional equipercentile equating procedures with item response theory models using the anchor test (overlapping items) approach, to date there is no evidence on the comparability of equating procedures when the scaling test approach is used. The purpose of this study was to compare the equipercentile, Rasch, one-parameter modified logistic, and two-parameter logistic item re sponse theory procedures in the equating of a multi level achievement test battery using the scaling test approach. Since the equipercentile method has been widely used by test publishers, it was chosen as a standard for comparison of the experimental results. Individual item pseudo-guessing parameters were specified for the one-parameter modified logistic and two-parameter logistic item response theory models based on the proportion of students in the national standardization sample selecting the least attractive distractor for the item. Two grades—fourth and eighth—and two subtests—reading and mathematics— were selected for analysis. The results of the study suggest that for a small-sample situation in which the scaling test approach has been applied to a multilevel achievement battery, the one-parameter modified and two-parameter item response theory methods (as modi fied in this study) appear to be viable alternatives to the equipercentile procedure.Keywords
This publication has 14 references indexed in Scilit:
- COMPARISON OF FOUR PROCEDURES FOR EQUATING THE TESTS OF GENERAL EDUCATIONAL DEVELOPMENTJournal of Educational Measurement, 1982
- Recovery of Two- and Three-Parameter Logistic Item Characteristic Curves: A Monte Carlo StudyApplied Psychological Measurement, 1982
- Choice of Test Model for Appropriateness MeasurementApplied Psychological Measurement, 1982
- UNIDIMENSIONALITY AND VERTICAL EQUATING WITH THE RASCH MODELJournal of Educational Measurement, 1982
- Comparison of a Rasch Model Scale and the Grade-Equivalent Scale for Vertical Equating of Test ScoresApplied Psychological Measurement, 1981
- Model-Free Evaluation of Equating and ScalingApplied Psychological Measurement, 1981
- COMPARISON OF TRADITIONAL AND ITEM RESPONSE THEORY METHODS FOR EQUATING TESTSJournal of Educational Measurement, 1981
- Testing and obtaining fit of data to the Rasch modelBritish Journal of Mathematical and Statistical Psychology, 1980
- VERTICAL EQUATING USING THE RASCH MODELJournal of Educational Measurement, 1980
- THE NATIONAL REFERENCE SCALE FOR READING: AN APPLICATION OF THE RASCH MODEL1Journal of Educational Measurement, 1977