Factors influencing testing time requirements for measurements using written simulations
- 1 January 1989
- journal article
- research article
- Published by Taylor & Francis in Teaching and Learning in Medicine
- Vol. 1 (2) , 85-91
- https://doi.org/10.1080/10401338909539387
Abstract
Review of the literature indicates that a major impediment to using written simulations is the large number of cases required to achieve an acceptable level of reproducibility or reliability. This article describes some of the factors affecting the reproducibility of simulation scores (and thus test length requirements) and identifies their impact. It concentrates on four factors affecting the reproducibility of simulations that assess a single skill: (a) score interpretation, (b) skill characteristics, (c) examinee characteristics, and (d) the scaling of scores. With few exceptions, score interpretation, the characteristics of the skill, and the characteristics of the examinees are not under the test developer's control. Once the purpose of measurement is fixed, so are most of these factors. On the other hand, it is often possible to focus cases without trivializing them or hurting the representativeness of the examination. It is also possible to apply item response theory to simulations and take advantage of the strong assumptions of the models to reduce test length. These developments merit the most attention in the future because they hold the promise of reducing test length and allowing wider use of simulations.Keywords
This publication has 12 references indexed in Scilit:
- Reliability of performance on standardized patient cases: A comparison of consistency measures based on generalizability theoryTeaching and Learning in Medicine, 1989
- A Criterion-Referenced Examination of Physician CompetenceEvaluation & the Health Professions, 1988
- A criterion-referenced examination in cardiovascular diseaseMedical Education, 1988
- The Answer Key as a Source of Error in Examinations for ProfessionalsJournal of Educational Measurement, 1987
- ASSESSMENT OF CLINICAL COMPETENCE: WRITTEN AND COMPUTER‐BASED SIMULATIONSAssessment & Evaluation in Higher Education, 1987
- An Evaluation of a Computer Simulation in the Assessment of Physician CompetenceEvaluation & the Health Professions, 1986
- Reliability, validity and efficiency of multiple choice question and patient management problem item formats in assessment of clinical competenceMedical Education, 1985
- The validity of licensure examinations.American Psychologist, 1982
- Agreement Coefficients as Indices of Dependability for Domain-Referenced TestsApplied Psychological Measurement, 1980
- AN INDEX OF DEPENDABILITY FOR MASTERY TESTSJournal of Educational Measurement, 1977