Factors influencing testing time requirements for measurements using written simulations

1 January 1989

journal article
research article
Published by Taylor & Francis in Teaching and Learning in Medicine

Vol. 1 (2) , 85-91
https://doi.org/10.1080/10401338909539387

Abstract

Review of the literature indicates that a major impediment to using written simulations is the large number of cases required to achieve an acceptable level of reproducibility or reliability. This article describes some of the factors affecting the reproducibility of simulation scores (and thus test length requirements) and identifies their impact. It concentrates on four factors affecting the reproducibility of simulations that assess a single skill: (a) score interpretation, (b) skill characteristics, (c) examinee characteristics, and (d) the scaling of scores. With few exceptions, score interpretation, the characteristics of the skill, and the characteristics of the examinees are not under the test developer's control. Once the purpose of measurement is fixed, so are most of these factors. On the other hand, it is often possible to focus cases without trivializing them or hurting the representativeness of the examination. It is also possible to apply item response theory to simulations and take advantage of the strong assumptions of the models to reduce test length. These developments merit the most attention in the future because they hold the promise of reducing test length and allowing wider use of simulations.

Keywords

This publication has 12 references indexed in Scilit:

Reliability of performance on standardized patient cases: A comparison of consistency measures based on generalizability theory
Teaching and Learning in Medicine, 1989
A Criterion-Referenced Examination of Physician Competence
Evaluation & the Health Professions, 1988
A criterion-referenced examination in cardiovascular disease
Medical Education, 1988
The Answer Key as a Source of Error in Examinations for Professionals
Journal of Educational Measurement, 1987
ASSESSMENT OF CLINICAL COMPETENCE: WRITTEN AND COMPUTER‐BASED SIMULATIONS
Assessment & Evaluation in Higher Education, 1987
An Evaluation of a Computer Simulation in the Assessment of Physician Competence
Evaluation & the Health Professions, 1986
Reliability, validity and efficiency of multiple choice question and patient management problem item formats in assessment of clinical competence
Medical Education, 1985
The validity of licensure examinations.
American Psychologist, 1982
Agreement Coefficients as Indices of Dependability for Domain-Referenced Tests
Applied Psychological Measurement, 1980
AN INDEX OF DEPENDABILITY FOR MASTERY TESTS
Journal of Educational Measurement, 1977