Reliability of Test Scores and Decisions
Open Access
- 1 October 1980
- journal article
- research article
- Published by SAGE Publications in Applied Psychological Measurement
- Vol. 4 (4) , 517-545
- https://doi.org/10.1177/014662168000400406
Abstract
A criterion-referenced test can be viewed as testing either a continuous or a binary variable, and the scores on a test can be used as measurements of the variable or to make decisions (e.g., pass or fail). Recent work on the reliability of criterion-refer enced tests has focused on the use of scores from tests of continuous variables for decision-making purposes. This work can be categorized according to type of loss function—threshold, linear, or quad ratic. It is the loss function that is used either ex plicitly or implicitly to evaluate the goodness of the decisions that are made on the basis of the test scores. The literature in which a threshold loss function is employed can be further subdivided ac cording to whether the goodness of decisions is as sessed as the probability of making an erroneous decision or as a measure of the consistency of deci sions over repeated testing occasions. This review points to the need for simple procedures by which to estimate the probability of decision errors.Keywords
This publication has 49 references indexed in Scilit:
- A Comparison of the Nedelsky and Angoff Cutting Score Procedures Using Generalizability TheoryApplied Psychological Measurement, 1980
- A STUDY OF THE ACCURACY OF SUBKOVIAK'S SINGLE‐ADMINISTRATION ESTIMATE OF THE COEFFICIENT OF AGREEMENT USING TWO TRUE‐SCORE ESTIMATESJournal of Educational Measurement, 1978
- Signal/Noise Ratios for Domain-Referenced TestsPsychometrika, 1977
- AN INDEX OF DEPENDABILITY FOR MASTERY TESTSJournal of Educational Measurement, 1977
- ITEM SAMPLING AND DECISION‐MAKING IN ACHIEVEMENT TESTINGBritish Journal of Mathematical and Statistical Psychology, 1974
- Large sample standard errors of kappa and weighted kappa.Psychological Bulletin, 1969
- MOMENTS OF THE STATISTICS KAPPA AND WEIGHTED KAPPABritish Journal of Mathematical and Statistical Psychology, 1968
- Weighted kappa: Nominal scale agreement provision for scaled disagreement or partial credit.Psychological Bulletin, 1968
- Content Standard Test ScoresEducational and Psychological Measurement, 1962
- A Coefficient of Agreement for Nominal ScalesEducational and Psychological Measurement, 1960