Reliability of Test Scores and Decisions

Open Access

1 October 1980

journal article
research article
Published by SAGE Publications in Applied Psychological Measurement

Vol. 4 (4) , 517-545
https://doi.org/10.1177/014662168000400406

Abstract

A criterion-referenced test can be viewed as testing either a continuous or a binary variable, and the scores on a test can be used as measurements of the variable or to make decisions (e.g., pass or fail). Recent work on the reliability of criterion-refer enced tests has focused on the use of scores from tests of continuous variables for decision-making purposes. This work can be categorized according to type of loss function—threshold, linear, or quad ratic. It is the loss function that is used either ex plicitly or implicitly to evaluate the goodness of the decisions that are made on the basis of the test scores. The literature in which a threshold loss function is employed can be further subdivided ac cording to whether the goodness of decisions is as sessed as the probability of making an erroneous decision or as a measure of the consistency of deci sions over repeated testing occasions. This review points to the need for simple procedures by which to estimate the probability of decision errors.

Keywords

This publication has 49 references indexed in Scilit:

A Comparison of the Nedelsky and Angoff Cutting Score Procedures Using Generalizability Theory
Applied Psychological Measurement, 1980
A STUDY OF THE ACCURACY OF SUBKOVIAK'S SINGLE‐ADMINISTRATION ESTIMATE OF THE COEFFICIENT OF AGREEMENT USING TWO TRUE‐SCORE ESTIMATES
Journal of Educational Measurement, 1978
Signal/Noise Ratios for Domain-Referenced Tests
Psychometrika, 1977
AN INDEX OF DEPENDABILITY FOR MASTERY TESTS
Journal of Educational Measurement, 1977
ITEM SAMPLING AND DECISION‐MAKING IN ACHIEVEMENT TESTING
British Journal of Mathematical and Statistical Psychology, 1974
Large sample standard errors of kappa and weighted kappa.
Psychological Bulletin, 1969
MOMENTS OF THE STATISTICS KAPPA AND WEIGHTED KAPPA
British Journal of Mathematical and Statistical Psychology, 1968
Weighted kappa: Nominal scale agreement provision for scaled disagreement or partial credit.
Psychological Bulletin, 1968
Content Standard Test Scores
Educational and Psychological Measurement, 1962
A Coefficient of Agreement for Nominal Scales
Educational and Psychological Measurement, 1960