Generalizability of Stratified-Parallel Tests

1 March 1965

journal article
Published by Cambridge University Press (CUP) in Psychometrika

Vol. 30 (1) , 39-56
https://doi.org/10.1007/bf02289746

Abstract

One of the major concerns of reliability theory has been the estimation of the reliability of a composite measure from the degree of agreement among its component parts. In the classical theory, formulas were developed under the assumption that the parts are strictly equivalent. It was later shown that the same formulas follow from various sets of weaker assumptions which require the composites to be strictly equivalent and require the parts to have a certain homogeneity of statistical properties, but not necessarily to be equivalent. An alternative model which has received increasing attention in recent years regards a given measure as a random sample from a universe of measures whose homogeneity or equivalence is not specified a priori, and a composite test as a random sample of items from a universe of not-necessarily-equivalent items. This too permits an internal-consistency estimate of reliability. Both the equivalent-composites model and the randomsampling model appear to be unduly restrictive and unrealistic; we propose here to develop the implications of a third model in which a test is considered to have been formed by stratified sampling of items.

Keywords

This publication has 15 references indexed in Scilit:

The Signal/Noise Ratio in the Comparison of Reliability Coefficients
Educational and Psychological Measurement, 1964
THEORY OF GENERALIZABILITY: A LIBERALIZATION OF RELIABILITY THEORY†
British Journal of Statistical Psychology, 1963
Internal-Consistency Reliability Formulas Applied to Randomly Sampled Single-Factor Tests: an Empirical Comparison
Educational and Psychological Measurement, 1962
An Approach to Mental Test Theory
Psychometrika, 1959
The Kuder-Richardson formula (21) as a Split-Half Coefficient, and Some Remarks on its Basic Assumption
Psychometrika, 1958
Average Values of Mean Squares in Factorials
The Annals of Mathematical Statistics, 1956
Sampling Error due to Choice of Split in Split-Half Reliability Coefficients
The Journal of Experimental Education, 1956
Estimating Test Reliability
Educational and Psychological Measurement, 1955
Estimation of the Reliability of Ratings
Psychometrika, 1951
Coefficient alpha and the internal structure of tests
Psychometrika, 1951