An Exploration of the Robustness of Four Test Equating Models
- 1 September 1986
- journal article
- research article
- Published by SAGE Publications in Applied Psychological Measurement
- Vol. 10 (3) , 303-317
- https://doi.org/10.1177/014662168601000308
Abstract
This monte carlo study explored how four com monly used test equating methods (linear, equipercen tile, and item response theory methods based on the Rasch and three-parameter models) responded to tests of different psychometric properties. The four methods were applied to generated data sets where mean item difficulty and discrimination as well as level of chance scoring were manipulated. In all cases, examinee abil ity was matched to the level of difficulty of the tests. The results showed the Rasch model not to be very robust to violations of the equal discrimination and non-chance scoring assumptions. There were also problems with the three-parameter model, but these were due primarily to estimation and linking prob lems. The recommended procedure for tests similar to those studied is the equipercentile method.Keywords
This publication has 14 references indexed in Scilit:
- IRT versus Conventional Equating Methods: A Comparative Study of Scale StabilityJournal of Educational Statistics, 1983
- Recovery of Two- and Three-Parameter Logistic Item Characteristic Curves: A Monte Carlo StudyApplied Psychological Measurement, 1982
- UNIDIMENSIONALITY AND VERTICAL EQUATING WITH THE RASCH MODELJournal of Educational Measurement, 1982
- Comparison of a Rasch Model Scale and the Grade-Equivalent Scale for Vertical Equating of Test ScoresApplied Psychological Measurement, 1981
- Some Empirical Results Related to the Robustness of the Rasch ModelApplied Psychological Measurement, 1981
- COMPARISON OF TRADITIONAL AND ITEM RESPONSE THEORY METHODS FOR EQUATING TESTSJournal of Educational Measurement, 1981
- VERTICAL EQUATING USING THE RASCH MODELJournal of Educational Measurement, 1980
- A NOTE ON VERTICAL EQUATING VIA THE RASCH MODEL FOR GROUPS OF QUITE DIFFERENT ABILITY AND TESTS OF QUITE DIFFERENT DIFFICULTYJournal of Educational Measurement, 1979
- AN EXPLORATION OF THE ADEQUACY OF THE RASCH MODEL FOR THE PROBLEM OF VERTICAL EQUATINGJournal of Educational Measurement, 1978
- THE NATIONAL REFERENCE SCALE FOR READING: AN APPLICATION OF THE RASCH MODEL1Journal of Educational Measurement, 1977