The Evaluator Effect: A Chilling Fact About Usability Evaluation Methods
- 1 February 2003
- journal article
- Published by Taylor & Francis in International Journal of Human–Computer Interaction
- Vol. 15 (1) , 183-204
- https://doi.org/10.1207/s15327590ijhc1501_14
Abstract
Computer professionals have a need for robust, easy-to-use usability evaluation methods (UEMs) to help them systematically improve the usability of computer artifacts. However, cognitive walkthrough (CW), heuristic evaluation (HE), and thinking- aloud study (TA)-3 of the most widely used UEMs-suffer from a substantial evaluator effect in that multiple evaluators evaluating the same interface with the same UEM detect markedly different sets of problems. A review of 11 studies of these 3 UEMs reveals that the evaluator effect exists for both novice and experienced evaluators, for both cosmetic and severe problems, for both problem detection and severity assessment, and for evaluations of both simple and complex systems. The average agreement between any 2 evaluators who have evaluated the same system using the same UEM ranges from 5% to 65%, and no 1 of the 3 UEMs is consistently better than the others. Although evaluator effects of this magnitude may not be surprising for a UEM as informal as HE, it is cer...This publication has 26 references indexed in Scilit:
- Evaluating Evaluation MethodsPublished by Cambridge University Press (CUP) ,2010
- Cognitive walkthroughs: a method for theory-based evaluation of user interfacesPublished by Elsevier ,2006
- Effect of Type of Information on Real Time Usability Evaluation: Implications for Remote Usability TestingProceedings of the Human Factors and Ergonomics Society Annual Meeting, 2000
- The Evaluator Effect in Usability Studies: Problem Detection and Severity JudgmentsProceedings of the Human Factors and Ergonomics Society Annual Meeting, 1998
- Damaged Merchandise? A Review of Experiments That Compare Usability Evaluation MethodsHuman–Computer Interaction, 1998
- Commentary on "Damaged Merchandise?"Human–Computer Interaction, 1998
- Interobserver variability in dermatopathologyArchives of Dermatology, 1997
- Cognitive WalkthroughsPublished by Elsevier ,1997
- Usability laboratoriesBehaviour & Information Technology, 1994
- Indexing consistency and qualityAmerican Documentation, 1969