Reliability and Validity of a Priori Estimates of Item Characteristics for an Examination of Health Science Information

1 December 1980

journal article
research article
Published by SAGE Publications in Educational and Psychological Measurement

Vol. 40 (4) , 1141-1146
https://doi.org/10.1177/001316448004000444

Abstract

A priori estimates of item characteristics are necessary for the effi cient development of sound tests. Judges were asked to rate the For mat, Relevancy, Difficulty, Discrimination, and Overall Quality of multiple-choice items for an examination covering health science in formation. Ratings of Relevancy were least reliable. The combined estimates of Difficulty did not correspond with empirical values of item difficulty; however, the combined ratings for Discrimination did correlate (point-biserial) significantly (p < .05) with item-total test scores. Additionally, combined ratings of Overall Quality were correlated significantly (p < .05) with item-total correlations.

Keywords

This publication has 8 references indexed in Scilit:

Correlation of Quarterly Profile Examination and National Board of Medical Examiner Scores
Educational and Psychological Measurement, 1977
Logical Versus Empirical Estimates of Item Difficulty
Educational and Psychological Measurement, 1977
Linguistic Determinants of the Difficulty of True-False Test Items
Educational and Psychological Measurement, 1976
Some Determinants of the Difficulty of Non-Verbal Classification Items
Educational and Psychological Measurement, 1961
Mental test performance as a function of payoff conditions, item difficulty, and degree of speeding.
Journal of Applied Psychology, 1960
The Value of Information to Good and Poor Judges of Item Difficulty
Educational and Psychological Measurement, 1954
The Improvement of Estimates of Test Difficulty
Educational and Psychological Measurement, 1953
Estimation of the Reliability of Ratings
Psychometrika, 1951