A Note on How to Quantify and Report Whether Irt Parameter Invariance Holds: When Pearson Correlations are Not Enough
- 1 August 2004
- journal article
- research article
- Published by SAGE Publications in Educational and Psychological Measurement
- Vol. 64 (4) , 588-599
- https://doi.org/10.1177/0013164403261051
Abstract
Based on seminal work by Lord and Hambleton, Swaminathan, and Rogers, this article is an analytical, graphical, and conceptual reminder that item response theory (IRT) parameter invariance only holds for perfect model fit in multiple populations or across multiple conditions and is thus an ideal state. In practice, one attempts to quantify the degree to which a lack of invariance is likely to be present through repeated calibrations of item and examinee parameters. Motivated by two recent studies on item parameter invariance, this article shows how a seemingly intuitive measure such as Pearson’s Product-Moment Correlation Coefficient (PPMCC) is insufficient for that purpose, as it is not sensitive to restrictive linear relationships such as identities, which are required for parameter invariance to hold. It thus misses, for example, additive group-level effects, which are observed in practice with translated instruments or with large-scale assessments such as TIMSS.Keywords
This publication has 20 references indexed in Scilit:
- Identifying national cultures of mathematics education: Analysis of cognitive demands and differential item functioning in TIMSSEuropean Journal of Psychology of Education, 2001
- Bayesian Estimation of a Multilevel IRT Model Using Gibbs SamplingPsychometrika, 2001
- The Effect of Error in Item Parameter Estimates on the Test Response Function Method of LinkingApplied Psychological Measurement, 2001
- An Evaluation of the Accuracy of Multidimensional IRT LinkingApplied Psychological Measurement, 2000
- Item Response Theory and Classical Test Theory: An Empirical Comparison of their Item/Person StatisticsEducational and Psychological Measurement, 1998
- A Comparison of Procedures to Detect Item Parameter DriftApplied Psychological Measurement, 1998
- Multilevel Item Response Models: An Approach to Errors in Variables RegressionJournal of Educational and Behavioral Statistics, 1997
- Linking Multidimensional Item CalibrationsApplied Psychological Measurement, 1996
- Test EquatingPublished by Springer Nature ,1995
- An NCME Instructional Module onEducational Measurement: Issues and Practice, 1993