An Investigation of the Item Parameter Recovery Characteristics of a Gibbs Sampling Procedure
- 1 June 1998
- journal article
- research article
- Published by SAGE Publications in Applied Psychological Measurement
- Vol. 22 (2) , 153-169
- https://doi.org/10.1177/01466216980222005
Abstract
The item parameter recovery characteristics of a Gibb's sampling method (Albert, 1992) for IRT item parameter estimation were investigated using a simulation study. The item parameters were estimated, under a normal ogive item response function model, using Gibbs sampling and BILOG (Mislevy & Bock, 1989). The item parameter estimates were then equated to the metric of the underlying item parameters for tests with 10, 20, 30, and 50 items, and samples of 30, 60, 120, and 500 examinees. Summary statistics of the equating coefficients showed that Gibbs sampling and BILOG both produced trait scale metrics with units of measurement that were too small, but yielding a proper midpoint of the metric. When expressed in a common metric, the biases of the BILOG estimates of the item discriminations were uniformly smaller and less variable than those from Gibbs sampling. The biases of the item difficulty estimates yielded by the two estimation procedures were small and similar to each other. In addition, the item parameter recovery characteristics were comparable for the largest dataset of 50 items and 500 examinees. However, for short tests and sample sizes the item parameter recovery characteristics of BILOG were superior to those of the Gibbs sampling approach.Keywords
This publication has 15 references indexed in Scilit:
- Markov Chain Monte Carlo Convergence Diagnostics: A Comparative ReviewJournal of the American Statistical Association, 1996
- EQUATE 2.0: A Computer Program for the Characteristic Curve Method of IRT EquatingApplied Psychological Measurement, 1993
- Bayesian Estimation of Normal Ogive Item Response Curves Using Gibbs SamplingJournal of Educational Statistics, 1992
- EQUATE: A Computer Program for the Test Characteristic Curve Method of IRT EquatingApplied Psychological Measurement, 1991
- Some Observations on the Metric of PC-BILOG ResultsApplied Psychological Measurement, 1990
- Sampling-Based Approaches to Calculating Marginal DensitiesJournal of the American Statistical Association, 1990
- Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of ImagesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1984
- Developing a Common Metric in Item Response TheoryApplied Psychological Measurement, 1983
- Recovery of Two- and Three-Parameter Logistic Item Characteristic Curves: A Monte Carlo StudyApplied Psychological Measurement, 1982
- Using Simulation Results to Choose a Latent Trait ModelApplied Psychological Measurement, 1981