Comparing Human and Automatic Face Recognition Performance
- 24 September 2007
- journal article
- research article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics)
- Vol. 37 (5) , 1248-1255
- https://doi.org/10.1109/tsmcb.2007.907036
Abstract
Face recognition technologies have seen dramatic improvements in performance over the past decade, and such systems are now widely used for security and commercial applications. Since recognizing faces is a task that humans are understood to be very good at, it is common to want to compare automatic face recognition (AFR) and human face recognition (HFR) in terms of biometric performance. This paper addresses this question by: 1) conducting verification tests on volunteers (HFR) and commercial AFR systems and 2) developing statistical methods to support comparison of the performance of different biometric systems. HFR was tested by presenting face-image pairs and asking subjects to classify them on a scale of ldquoSame,rdquo ldquoProbably Same,rdquo ldquoNot sure,rdquo ldquoProbably Different,rdquo and ldquoDifferentrdquo; the same image pairs were presented to AFR systems, and the biometric match score was measured. To evaluate these results, two new statistical evaluation techniques are developed. The first is a new way to normalize match-score distributions, where a normalized match score is calculated as a function of the angle from a representation of [false match rate, false nonmatch rate] values in polar coordinates from some center. Using this normalization, we develop a second methodology to calculate an average detection error tradeoff (DET) curve and show that this method is equivalent to direct averaging of DET data along each angle from the center. This procedure is then applied to compare the performance of the best AFR algorithms available to us in the years 1999, 2001, 2003, 2005, and 2006, in comparison to human scores. Results show that algorithms have dramatically improved in performance over that time. In comparison to the performance of the best AFR system of 2006, 29.2% of human subjects performed better, while 37.5% performed worse.Keywords
This publication has 18 references indexed in Scilit:
- Score normalization in multimodal biometric systemsPublished by Elsevier ,2005
- Face recognitionACM Computing Surveys, 2003
- Face recognition is robust with incongruent image resolution: Relationship to security video images.Journal of Experimental Psychology: Applied, 2003
- Sex differences in face recognition—Women’s faces make the differenceBrain and Cognition, 2002
- Human and automatic face recognition: a comparison across image formatsVision Research, 2001
- An introduction evaluating biometric systemsComputer, 2000
- Federal biometric technology legislationComputer, 2000
- The use of the area under the ROC curve in the evaluation of machine learning algorithmsPattern Recognition, 1997
- Eigenfaces for RecognitionJournal of Cognitive Neuroscience, 1991
- The meaning and use of the area under a receiver operating characteristic (ROC) curve.Radiology, 1982