The Effect of Data Sampling on the Performance Evaluation of Artificial Neural Networks in Medical Diagnosis
- 1 April 1997
- journal article
- Published by SAGE Publications in Medical Decision Making
- Vol. 17 (2) , 186-192
- https://doi.org/10.1177/0272989x9701700209
Abstract
Purpose. To study the effect of data sampling on the predictive assessment of artificial neural networks (ANNs) for medical diagnostic tasks. Methods. Three statistical techniques were used to evaluate the diagnostic performances of ANNs: 1) cross validation, 2) round robin, and 3) bootstrap. These techniques are different sampling plans designed to reduce the small-sample estimation bias and variance contributions. The study was based on two networks, one developed for the diagnosis of pulmonary embolism (1,064 cases) and the other developed for the diagnosis of breast cancer (206 cases). Results. The three sampling techniques produced different performance estimates for both networks. The estimates varied substantially depending on the training sample size and the training-stopping criterion. Conclusion. The predictive assessment of ANNs in medical diagnosis can vary substantially based on the complexity of the problem, the data sampling technique, and the number of cases available. Key words: computer-aided diagnosis; artificial neural networks; sampling; cross validation; round robin; bootstrap; receiver operating characteristic analysis. (Med Decis Making 1997;17:186-192)Keywords
This publication has 19 references indexed in Scilit:
- Artificial neural network for diagnosis of acute pulmonary embolism: effect of case and observer selection.Radiology, 1995
- Screening: Assessment of current studiesCancer, 1994
- Acute pulmonary embolism: artificial neural network approach for diagnosis.Radiology, 1993
- Clinical characteristics of patients with acute pulmonary embolismThe American Journal of Cardiology, 1991
- Value of the Ventilation/Perfusion Scan in Acute Pulmonary EmbolismJAMA, 1990
- Chapter IV. Reduced breast-cancer mortality with mammography screening-an assessment of currently available dataInternational Journal of Cancer, 1990
- Learning in Artificial Neural Networks: A Statistical PerspectiveNeural Computation, 1989
- An introduction to computing with neural netsIEEE ASSP Magazine, 1987
- The perceptron: A probabilistic model for information storage and organization in the brain.Psychological Review, 1958
- A logical calculus of the ideas immanent in nervous activityBulletin of Mathematical Biology, 1943