Agreement Coefficients as Indices of Dependability for Domain-Referenced Tests

1 January 1980

journal article
research article
Published by SAGE Publications in Applied Psychological Measurement

Vol. 4 (1) , 105-126
https://doi.org/10.1177/014662168000400111

Abstract

A large number of seemingly diverse coefficients have been proposed as indices of dependability, or reliability, for domain-referenced and/or mastery tests. In this paper it is shown that most of these indices are special cases of two generalized indices of agreement—one that is corrected for chance and one that is not. The special cases of these two in dices are determined by assumptions about the na ture of the agreement function or, equivalently, the nature of the loss function for the testing proce dure. For example, indices discussed by Huynh (1976), Subkoviak (1976), and Swaminathan, Hambleton, and Algina (1974) employ a threshold agreement, or loss, function; whereas, indices dis cussed by Brennan and Kane (1977a, 1977b) and Livingston (1972a) employ a squared-error loss function. Since all of these indices are discussed within a single general framework, the differences among them in their assumptions, properties, and uses can be exhibited clearly. For purposes of com parison, norm-referenced generalizability coeffi cients are also developed and discussed within this general framework.

Keywords

This publication has 23 references indexed in Scilit:

Criterion-Referenced Testing and Measurement: A Review of Technical Issues and Developments
Review of Educational Research, 1978
Signal/Noise Ratios for Domain-Referenced Tests
Psychometrika, 1977
AN INDEX OF DEPENDABILITY FOR MASTERY TESTS
Journal of Educational Measurement, 1977
ON THE RELIABILITY OF DECISIONS IN DOMAIN‐REFERENCED TESTING
Journal of Educational Measurement, 1976
TOWARD AN INTEGRATION OF THEORY AND METHOD FOR CRITERION‐REFERENCED TESTS^1,²
Journal of Educational Measurement, 1973
CRITERION‐REFERENCED APPLICATIONS OF CLASSICAL TEST THEORY ^1,²
Journal of Educational Measurement, 1972
A REPLY TO HARRIS "AN INTERPRETATION OF LIVINGSTON'S RELIABILITY COEFFICIENT FOR CRITERION-REFERENCED TESTS"
Journal of Educational Measurement, 1972
Instructional technology and the measurement of learing outcomes: Some questions.
American Psychologist, 1963
A Coefficient of Agreement for Nominal Scales
Educational and Psychological Measurement, 1960
Coefficient alpha and the internal structure of tests
Psychometrika, 1951