A Comparison of Selected Empirical Methods for Assessing the Structure of Responses to Test Items
- 1 May 2003
- journal article
- other
- Published by SAGE Publications in Applied Psychological Measurement
- Vol. 27 (3) , 159-203
- https://doi.org/10.1177/0146621603027003001
Abstract
Selected methods of empirically assessing the structure of tests with dichotomous items were compared. The methods included both exploratory and confirmatory procedures from two different families, those based on parametric models and nonparametric methods based on conditional item covariances. The analysis conditions considered were typical of large-scale assessments, for example, the tests were composed of a relatively large number of items, and it was assumed that a relatively large sample size would be available for analysis. Comparisons of the methods were conducted for real data from a 62-item test of reading ability and for computer-generated data for multiple unidimensional and multidimensional cases. For the most part, all methods performed reasonably well over a relatively wide range of conditions. The several exceptions to this outcome occurred when the test data departed appreciably from the assumptions or inherent limitations associated with a method, for example, when guessing was present but not allowed for in the analysis or when the multidimensional test structure was nonsimple but the goal of the method was to estimate the amount of multidimensional simple structure. Index terms: test structure, test dimensionality, local item dependencies, test factors.Keywords
This publication has 66 references indexed in Scilit:
- The Performance of Dimtest When Latent Trait and Item Difficulty Distributions DifferApplied Psychological Measurement, 2000
- Item‐Bundle DIF Hypothesis Testing: Identifying Suspect Bundles and Assessing Their Differential FunctioningJournal of Educational Measurement, 1996
- Test Theory ReconceivedJournal of Educational Measurement, 1996
- Refinements of Stout's Procedure for Assessing Latent Trait UnidimensionalityJournal of Educational Statistics, 1993
- DIMTEST: A Fortran Program for Assessing Dimensionality of Binary Item ResponsesApplied Psychological Measurement, 1992
- On the Reliability of Testlet‐Based TestsJournal of Educational Measurement, 1991
- Assessing the Dimensionality of NAEP Reading DataJournal of Educational Measurement, 1987
- Recent Developments in the Factor Analysis of Categorical VariablesJournal of Educational Statistics, 1986
- Full-Information Item Factor Analysis: Applications of EAP ScoresApplied Psychological Measurement, 1985
- The Difficulty of Test Items That Measure More Than One AbilityApplied Psychological Measurement, 1985