Simultaneous feature selection and clustering using mixture models
Top Cited Papers
- 26 July 2004
- journal article
- research article
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 26 (9) , 1154-1166
- https://doi.org/10.1109/tpami.2004.71
Abstract
Clustering is a common unsupervised learning technique used to discover group structure in a set of data. While there exist many algorithms for clustering, the important issue of feature selection, that is, what attributes of the data should be used by the clustering algorithms, is rarely touched upon. Feature selection for clustering is difficult because, unlike in supervised learning, there are no class labels for the data and, thus, no obvious criteria to guide the search. Another important problem in clustering is the determination of the number of clusters, which clearly impacts and is influenced by the feature selection issue. In this paper, we propose the concept of feature saliency and introduce an expectation-maximization (EM) algorithm to estimate it, in the context of mixture-based clustering. Due to the introduction of a minimum message length model selection criterion, the saliency of irrelevant features is driven toward zero, which corresponds to performing feature selection. The criterion and algorithm are then extended to simultaneously estimate the feature saliencies and the number of clusters.Keywords
This publication has 41 references indexed in Scilit:
- Maximum certainty data partitioningPattern Recognition, 2000
- Normalized cuts and image segmentationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2000
- Wrappers for feature subset selectionArtificial Intelligence, 1997
- A Decision-Theoretic Generalization of On-Line Learning and an Application to BoostingJournal of Computer and System Sciences, 1997
- Divergence based feature selection for multimodal class densitiesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1996
- Feature selection based on the approximation of class densities by finite mixtures of special typePattern Recognition, 1995
- Floating search methods in feature selectionPattern Recognition Letters, 1994
- Using mutual information for selecting features in supervised neural net learningIEEE Transactions on Neural Networks, 1994
- Small sample size effects in statistical pattern recognition: recommendations for practitionersPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1991
- Unsupervised texture segmentation using Gabor filtersPattern Recognition, 1991