A Comparison of the Rasch Separate Calibration and between-Fit Methods of Detecting Item Bias
- 1 June 1996
- journal article
- research article
- Published by SAGE Publications in Educational and Psychological Measurement
- Vol. 56 (3) , 403-418
- https://doi.org/10.1177/0013164496056003003
Abstract
The objective of this study is to compare two methods of detecting item bias within the framework of Rasch measurement. To accomplish this objective, it was first necessary to arrive at a clear understanding of the definition of bias as commonly used with Rasch measurement models. The comparison between the two methods was based on the Type I error rates in data that contain no bias and the power of the statistics to detect item bias when bias is present. The variables manipulated in this study included sample size, magnitude of bias, number of biased items present on the tests, and mean differences in the ability of the reference and focal groups. The two methods compared were the separate calibration t-test approach proposed by Wright and Stone in 1979 and the common calibration between-fit approach proposed by Wright, Mead, and Draba in 1976.The results indicate that the arbitrary use of bias levels such as +2 can result in the misidentification of biased items.Keywords
This publication has 5 references indexed in Scilit:
- A Comparison of the Power of Rasch Total and Between-Item Fit Statistics to Detect Measurement DisturbancesEducational and Psychological Measurement, 1994
- Gender differences in item performance and predictive validity on the DAT Quantitative Reasoning TestJournal of Dental Education, 1989
- Rasch Models for MeasurementPublished by SAGE Publications ,1988
- A MONTE CARLO COMPARISON OF SEVEN BIASED ITEM DETECTION TECHNIQUESJournal of Educational Measurement, 1980
- A Procedure for Sample-Free Item AnalysisEducational and Psychological Measurement, 1969