A Comparison of the Rasch Separate Calibration and between-Fit Methods of Detecting Item Bias

1 June 1996

journal article
research article
Published by SAGE Publications in Educational and Psychological Measurement

Vol. 56 (3) , 403-418
https://doi.org/10.1177/0013164496056003003

Abstract

The objective of this study is to compare two methods of detecting item bias within the framework of Rasch measurement. To accomplish this objective, it was first necessary to arrive at a clear understanding of the definition of bias as commonly used with Rasch measurement models. The comparison between the two methods was based on the Type I error rates in data that contain no bias and the power of the statistics to detect item bias when bias is present. The variables manipulated in this study included sample size, magnitude of bias, number of biased items present on the tests, and mean differences in the ability of the reference and focal groups. The two methods compared were the separate calibration t-test approach proposed by Wright and Stone in 1979 and the common calibration between-fit approach proposed by Wright, Mead, and Draba in 1976.The results indicate that the arbitrary use of bias levels such as +2 can result in the misidentification of biased items.

Keywords

This publication has 5 references indexed in Scilit:

A Comparison of the Power of Rasch Total and Between-Item Fit Statistics to Detect Measurement Disturbances
Educational and Psychological Measurement, 1994
Gender differences in item performance and predictive validity on the DAT Quantitative Reasoning Test
Journal of Dental Education, 1989
Rasch Models for Measurement
Published by SAGE Publications ,1988
A MONTE CARLO COMPARISON OF SEVEN BIASED ITEM DETECTION TECHNIQUES
Journal of Educational Measurement, 1980
A Procedure for Sample-Free Item Analysis
Educational and Psychological Measurement, 1969