The Effects of Guessing and Item Dependence on the Reliability and Validity of Recognition Based Cloze Tests
- 1 September 1982
- journal article
- Published by SAGE Publications in Educational and Psychological Measurement
- Vol. 42 (3) , 855-867
- https://doi.org/10.1177/001316448204200321
Abstract
Matching doze and multiple-choice doze tests of elementary level reading comprehension have demonstrated promising construct and concurrent validity. However, their formats include guessing and, in the case of matching cloze, item dependence effects. This study used a Monte Carlo design to examine how these effects influence test characteristics and student scores. First, a true score test with six levels of difficulty was constructed. Then, two versions of matching doze and three of multiple-choice doze results were generated for each level of difficulty of the true score data by applying the doze assumptions of random guessing and item dependence. Although validity for all five doze variants was high, multiple-choice doze had significantly lower reliabilities than did the true score equivalents. Item dependence found in matching doze had little or no effect on the characteristics of these tests.Keywords
This publication has 14 references indexed in Scilit:
- Matching and Multiple-Choice Cloze TestsThe Journal of Educational Research, 1979
- Preliminary Evidence Regarding the Validity of a Modified Cloze Procedure for Lower Elementary Esl StudentsEducational and Psychological Measurement, 1978
- OPTIMAL NUMBER OF CHOICES PER ITEM— A COMPARISON OF FOUR APPROACHES*Journal of Educational Measurement, 1977
- IMPROVING ON THE BASIC EGG: THE M‐C CLOZELanguage Learning, 1976
- THE NUMBER OF ALTERNATIVES FOR OPTIMUM TEST RELIABILITYJournal of Educational Measurement, 1975
- Reading comprehension and syntactic responses in good and poor readers.Journal of Educational Psychology, 1973
- How to Construct Achievement Tests to Assess ComprehensionReview of Educational Research, 1972
- Expected Reliability as a Function of Choices Per ItemEducational and Psychological Measurement, 1969
- The Effects of Guessing On the Standard Error of Measurement and the Reliability of Test ScoresEducational and Psychological Measurement, 1965
- The Approximate Sampling Distribution of Kuder-Richardson Reliability Coefficient TwentyPsychometrika, 1965