Detecting Potentially Biased Test Items: Comparison of IRT Area and Mantel-Haenszel Methods
- 1 October 1989
- journal article
- Published by Taylor & Francis in Applied Measurement in Education
- Vol. 2 (4) , 313-334
- https://doi.org/10.1207/s15324818ame0204_4
Abstract
The purpose of this study was to compare the IRT-based area method and the Mantel-Haenszel method for investigating differential item functioning (DIF), to determine the degree of agreement between the methods in identifying potentially biased items, and, when the two methods led to different results, to identify possible reasons for the discrepancies. Data for the study were the item responses of Anglo American and Native American students who took the 1982 New Mexico High School Proficiency Exam. Two samples of 1,000 students from each group were studied. The major findings were that (a) the consistency of classifications of items into "biased" and "not-biased" categories across replications was 75% to 80% for both methods and (b) when the unreliability of the statistics was taken into account, the two methods led to very similar results. Discrepancies between methods were due to the presence of nonuniform DIF (the Mantel-Haenszel method could not identify these items) and the choice of interval over which DIF was assessed (the IRT method results depended on the choice of interval). The implications for practitioners seem clear: The Mantel-Haenszel method in general provides an acceptable approximation to the IRT-based methods.Keywords
This publication has 7 references indexed in Scilit:
- Evaluation of Computer Simulated Baseline Statistics for Use in Item Bias StudiesEducational and Psychological Measurement, 1989
- Item Response TheoryPublished by Springer Nature ,1985
- Accounting for Statistical Artifacts in Item Bias ResearchJournal of Educational Statistics, 1984
- EMPIRICAL COMPARISON OF SELECTED ITEM BIAS DETECTION PROCEDURES WITH BIAS MANIPULATIONJournal of Educational Measurement, 1984
- Comparison of Procedures for Detecting test-Item Bias with both Internal and External Ability CriteriaJournal of Educational Statistics, 1981
- A MONTE CARLO COMPARISON OF SEVEN BIASED ITEM DETECTION TECHNIQUESJournal of Educational Measurement, 1980
- A COMPARISON OF SEVERAL METHODS OF ASSESSING ITEM BIASJournal of Educational Measurement, 1979