A Simulation Study of Reliability and Validity of Multiple-Choice Test Scores Under Six Response-Scoring Modes

1 December 1982

journal article
Published by American Educational Research Association (AERA) in Journal of Educational Statistics

Vol. 7 (4) , 333-351
https://doi.org/10.3102/10769986007004333

Abstract

Responses to a 40-item, four-choice test were simulated for 120 examinees under six response-scoring modes including number-right, corrected-for-guessing and answer-until-correct. Separate score sets were generated to reflect five levels of prevalence of misinformation (belief that an answer is a distractor) and five levels of propensity-to-guess contrary to instructions for modes designed to inhibit guessing. Criteria were simulated using the number-right mode with five levels of misinformation prevalence and four levels of true-score relationship with the predictor. The entire process was repeated with the introduction of normally distributed, random error at the item level. This process yielded 260 sets of five scores (predictor and four criteria), which were examined to determine differential effects on reliability and validity attributable to the response-scoring modes. Modes permitting multiple responses to an item were found to yield genuine increases in internal consistency reliability, which tended to carry over into validity coefficients. However, the validity differences among all the response-scoring modes simulated were small, probably too small to justify the additional cost and complexity of modes other than number-right.

Keywords

This publication has 12 references indexed in Scilit:

Grading Distractor-Identification Tests
Psychometrika, 1981
The Effect of Misinformation, Partial Information, and Guessing on Expected Multiple-Choice Test Item Scores
Applied Psychological Measurement, 1980
Alternative Response and Scoring Methods for Multiple-Choice Items: An Empirical Study of Probabilistic and Ordinal Response Modes
Applied Psychological Measurement, 1978
Random Guessing, Correction for Guessing, and Reliability of Multiple-Choice Test Scores
The Journal of Experimental Education, 1977
FORMULA SCORING AND NUMBER‐RIGHT SCORING¹
Journal of Educational Measurement, 1975
The Correction for Guessing
Review of Educational Research, 1973
Effect of Variation in Probability of Guessing Correctly on Reliability of Multiple-Choice Tests
Educational and Psychological Measurement, 1970
On Scoring Multiple Choice Exams Allowing for Partial Knowledge
The Journal of Experimental Education, 1970
Some Modifications of the Multiple-Choice Item
Educational and Psychological Measurement, 1953
On the Use of Objective Examinations
Educational and Psychological Measurement, 1953