Data mining methods in the prediction of Dementia: A real-data comparison of the accuracy, sensitivity and specificity of linear discriminant analysis, logistic regression, neural networks, support vector machines, classification trees and random forests
Top Cited Papers
Open Access
- 17 August 2011
- journal article
- Published by Springer Nature in BMC Research Notes
- Vol. 4 (1) , 299
- https://doi.org/10.1186/1756-0500-4-299
Abstract
Dementia and cognitive impairment associated with aging are a major medical and social concern. Neuropsychological testing is a key element in the diagnostic procedures of Mild Cognitive Impairment (MCI), but has presently a limited value in the prediction of progression to dementia. We advance the hypothesis that newer statistical classification methods derived from data mining and machine learning methods like Neural Networks, Support Vector Machines and Random Forests can improve accuracy, sensitivity and specificity of predictions obtained from neuropsychological testing. Seven non parametric classifiers derived from data mining methods (Multilayer Perceptrons Neural Networks, Radial Basis Function Neural Networks, Support Vector Machines, CART, CHAID and QUEST Classification Trees and Random Forests) were compared to three traditional classifiers (Linear Discriminant Analysis, Quadratic Discriminant Analysis and Logistic Regression) in terms of overall classification accuracy, specificity, sensitivity, Area under the ROC curve and Press'Q. Model predictors were 10 neuropsychological tests currently used in the diagnosis of dementia. Statistical distributions of classification parameters obtained from a 5-fold cross-validation were compared using the Friedman's nonparametric test.Keywords
This publication has 55 references indexed in Scilit:
- Feature Selection and Performance Evaluation of Support Vector Machine (SVM)-Based Classifier for Differentiating Benign and Malignant Pulmonary Nodules by Computed TomographyJournal of Digital Imaging, 2009
- Amnestic syndrome of the medial temporal type identifies prodromal ADNeurology, 2007
- Assessment of the performances of multilayer perceptron neural networks in comparison with recurrent neural networks and two statistical methods for diagnosing coronary artery diseaseExpert Systems, 2007
- Global prevalence of dementia: a Delphi consensus studyPublished by Elsevier ,2006
- Jährliche Konversionsrate von Patienten mit Gedächtnisbeeinträchtigung zur Alzheimerkrankheit: Der Einfluss von amnestischer MCI und die prädiktive Aussagekraft der neuropsychologischen TestungWiener klinische Wochenschrift, 2005
- The support vector machine under testNeurocomputing, 2003
- Comparing Linear Discriminant Function With Logistic Regression for the Two-Group Classification ProblemThe Journal of Experimental Education, 1999
- Small sample size effects in statistical pattern recognition: recommendations for practitionersPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1991
- Effects of sample size in classifier designPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1989
- The Efficiency of Logistic Regression Compared to Normal Discriminant AnalysisJournal of the American Statistical Association, 1975