Misclassification among methods used for multiple group discrimination‐the effects of distributional properties
- 1 May 1991
- journal article
- research article
- Published by Wiley in Statistics in Medicine
- Vol. 10 (5) , 757-766
- https://doi.org/10.1002/sim.4780100511
Abstract
Methods of multiple group discriminant analysis have not been fully studied with respect to classification into more than two populations when the covariate distributions are normal or non‐normal. The present study examines the classification performance of several multiple discrimination methods under a variety of simulated continuous normal and non‐normal covariate distributions. The methods include polychotomous logistic regression, multiple group linear discriminant analysis, kernel density estimation, and rank transformations of the data as input into the linear function. The parameters of interest were distance among populations, configuration of population mean vectors (collinear or forming the vertices of a regular simplex), skewness, kurtosis and bimodality. Simulation of the last three parameters was by log‐normal, sinh−1normal and a two‐component mixture of normal distributions, respectively. Results with three trivariate populations show that for all distributions, logistic discrimination classifies close to the optimal under Neyman‐Pearson allocation. These results suggest that logistic discrimination is preferable to other widely‐used methods for multiple group classification with non‐normal data, and is comparable to classification by multiple linear discrimination with normal data.Keywords
This publication has 46 references indexed in Scilit:
- The Efficiency of Multinomial Logistic Regression Compared with Multiple Group Discriminant AnalysisJournal of the American Statistical Association, 1987
- Frequency Polygons: Theory and ApplicationJournal of the American Statistical Association, 1985
- Oversmoothed Nonparametric Density EstimatesJournal of the American Statistical Association, 1985
- Robustness of Fisher's Linear Discriminant Function under Two-Component Mixed Normal ModelsJournal of the American Statistical Association, 1981
- A Note on Bias Correction in Maximum likelihood Estimation With logistic DiscriminationTechnometrics, 1980
- A new approach for evaluating risk factors in coronary artery disease: a study of lipid concentrations and severity of disease in 1847 males.Circulation, 1980
- Logistic Discrimination and Bias Correction in Maximum Likelihood EstimationTechnometrics, 1979
- The Performance of Fisher's Linear Discriminant Function Under Non-Optimal ConditionsTechnometrics, 1977
- The Efficiency of Logistic Regression Compared to Normal Discriminant AnalysisJournal of the American Statistical Association, 1975
- A Comparison of Some Multivariate Discrimination ProceduresJournal of the American Statistical Association, 1972