Detecting Potentially Biased Test Items: Comparison of IRT Area and Mantel-Haenszel Methods

1 October 1989

journal article
Published by Taylor & Francis in Applied Measurement in Education

Vol. 2 (4) , 313-334
https://doi.org/10.1207/s15324818ame0204_4

Abstract

The purpose of this study was to compare the IRT-based area method and the Mantel-Haenszel method for investigating differential item functioning (DIF), to determine the degree of agreement between the methods in identifying potentially biased items, and, when the two methods led to different results, to identify possible reasons for the discrepancies. Data for the study were the item responses of Anglo American and Native American students who took the 1982 New Mexico High School Proficiency Exam. Two samples of 1,000 students from each group were studied. The major findings were that (a) the consistency of classifications of items into "biased" and "not-biased" categories across replications was 75% to 80% for both methods and (b) when the unreliability of the statistics was taken into account, the two methods led to very similar results. Discrepancies between methods were due to the presence of nonuniform DIF (the Mantel-Haenszel method could not identify these items) and the choice of interval over which DIF was assessed (the IRT method results depended on the choice of interval). The implications for practitioners seem clear: The Mantel-Haenszel method in general provides an acceptable approximation to the IRT-based methods.

Keywords

This publication has 7 references indexed in Scilit:

Evaluation of Computer Simulated Baseline Statistics for Use in Item Bias Studies
Educational and Psychological Measurement, 1989
Item Response Theory
Published by Springer Nature ,1985
Accounting for Statistical Artifacts in Item Bias Research
Journal of Educational Statistics, 1984
EMPIRICAL COMPARISON OF SELECTED ITEM BIAS DETECTION PROCEDURES WITH BIAS MANIPULATION
Journal of Educational Measurement, 1984
Comparison of Procedures for Detecting test-Item Bias with both Internal and External Ability Criteria
Journal of Educational Statistics, 1981
A MONTE CARLO COMPARISON OF SEVEN BIASED ITEM DETECTION TECHNIQUES
Journal of Educational Measurement, 1980
A COMPARISON OF SEVERAL METHODS OF ASSESSING ITEM BIAS
Journal of Educational Measurement, 1979