Generalizability of Stratified-Parallel Tests
- 1 March 1965
- journal article
- Published by Cambridge University Press (CUP) in Psychometrika
- Vol. 30 (1) , 39-56
- https://doi.org/10.1007/bf02289746
Abstract
One of the major concerns of reliability theory has been the estimation of the reliability of a composite measure from the degree of agreement among its component parts. In the classical theory, formulas were developed under the assumption that the parts are strictly equivalent. It was later shown that the same formulas follow from various sets of weaker assumptions which require the composites to be strictly equivalent and require the parts to have a certain homogeneity of statistical properties, but not necessarily to be equivalent. An alternative model which has received increasing attention in recent years regards a given measure as a random sample from a universe of measures whose homogeneity or equivalence is not specified a priori, and a composite test as a random sample of items from a universe of not-necessarily-equivalent items. This too permits an internal-consistency estimate of reliability. Both the equivalent-composites model and the randomsampling model appear to be unduly restrictive and unrealistic; we propose here to develop the implications of a third model in which a test is considered to have been formed by stratified sampling of items.Keywords
This publication has 15 references indexed in Scilit:
- The Signal/Noise Ratio in the Comparison of Reliability CoefficientsEducational and Psychological Measurement, 1964
- THEORY OF GENERALIZABILITY: A LIBERALIZATION OF RELIABILITY THEORY†British Journal of Statistical Psychology, 1963
- Internal-Consistency Reliability Formulas Applied to Randomly Sampled Single-Factor Tests: an Empirical ComparisonEducational and Psychological Measurement, 1962
- An Approach to Mental Test TheoryPsychometrika, 1959
- The Kuder-Richardson formula (21) as a Split-Half Coefficient, and Some Remarks on its Basic AssumptionPsychometrika, 1958
- Average Values of Mean Squares in FactorialsThe Annals of Mathematical Statistics, 1956
- Sampling Error due to Choice of Split in Split-Half Reliability CoefficientsThe Journal of Experimental Education, 1956
- Estimating Test ReliabilityEducational and Psychological Measurement, 1955
- Estimation of the Reliability of RatingsPsychometrika, 1951
- Coefficient alpha and the internal structure of testsPsychometrika, 1951