Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy
Top Cited Papers
- 20 June 2005
- journal article
- research article
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 27 (8) , 1226-1238
- https://doi.org/10.1109/tpami.2005.159
Abstract
Feature selection is an important problem for pattern classification systems. We study how to select good features according to the maximal statistical dependency criterion based on mutual information. Because of the difficulty in directly implementing the maximal dependency condition, we first derive an equivalent form, called minimal-redundancy-maximal-relevance criterion (mRMR), for first-order incremental feature selection. Then, we present a two-stage feature selection algorithm by combining mRMR and other more sophisticated feature selectors (e.g., wrappers). This allows us to select a compact set of superior features at very low cost. We perform extensive experimental comparison of our algorithm and other methods using three different classifiers (naive Bayes, support vector machine, and linear discriminate analysis) and four different data sets (handwritten digits, arrhythmia, NCI cancer cell lines, and lymphoma tissues). The results confirm that mRMR leads to promising improvement on feature selection and classification accuracy.Keywords
This publication has 19 references indexed in Scilit:
- Feature selection for multiclass discrimination via mixed-integer linear programmingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Input feature selection by mutual information based on Parzen windowPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- A comparison of methods for multiclass support vector machinesIEEE Transactions on Neural Networks, 2002
- A gene expression database for the molecular pharmacology of cancerNature Genetics, 2000
- Systematic variation in gene expression patterns in human cancer cell linesNature Genetics, 2000
- Distinct types of diffuse large B-cell lymphoma identified by gene expression profilingNature, 2000
- Statistical pattern recognition: a reviewPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2000
- Wrappers for feature subset selectionArtificial Intelligence, 1997
- Floating search methods in feature selectionPattern Recognition Letters, 1994
- On Estimation of a Probability Density Function and ModeThe Annals of Mathematical Statistics, 1962