A Comparison of Selected Empirical Methods for Assessing the Structure of Responses to Test Items

1 May 2003

journal article
other
Published by SAGE Publications in Applied Psychological Measurement

Vol. 27 (3) , 159-203
https://doi.org/10.1177/0146621603027003001

Abstract

Selected methods of empirically assessing the structure of tests with dichotomous items were compared. The methods included both exploratory and confirmatory procedures from two different families, those based on parametric models and nonparametric methods based on conditional item covariances. The analysis conditions considered were typical of large-scale assessments, for example, the tests were composed of a relatively large number of items, and it was assumed that a relatively large sample size would be available for analysis. Comparisons of the methods were conducted for real data from a 62-item test of reading ability and for computer-generated data for multiple unidimensional and multidimensional cases. For the most part, all methods performed reasonably well over a relatively wide range of conditions. The several exceptions to this outcome occurred when the test data departed appreciably from the assumptions or inherent limitations associated with a method, for example, when guessing was present but not allowed for in the analysis or when the multidimensional test structure was nonsimple but the goal of the method was to estimate the amount of multidimensional simple structure. Index terms: test structure, test dimensionality, local item dependencies, test factors.

Keywords

This publication has 66 references indexed in Scilit:

The Performance of Dimtest When Latent Trait and Item Difficulty Distributions Differ
Applied Psychological Measurement, 2000
Item‐Bundle DIF Hypothesis Testing: Identifying Suspect Bundles and Assessing Their Differential Functioning
Journal of Educational Measurement, 1996
Test Theory Reconceived
Journal of Educational Measurement, 1996
Refinements of Stout's Procedure for Assessing Latent Trait Unidimensionality
Journal of Educational Statistics, 1993
DIMTEST: A Fortran Program for Assessing Dimensionality of Binary Item Responses
Applied Psychological Measurement, 1992
On the Reliability of Testlet‐Based Tests
Journal of Educational Measurement, 1991
Assessing the Dimensionality of NAEP Reading Data
Journal of Educational Measurement, 1987
Recent Developments in the Factor Analysis of Categorical Variables
Journal of Educational Statistics, 1986
Full-Information Item Factor Analysis: Applications of EAP Scores
Applied Psychological Measurement, 1985
The Difficulty of Test Items That Measure More Than One Ability
Applied Psychological Measurement, 1985