Input feature selection for classification problems
Top Cited Papers
- 7 August 2002
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Neural Networks
- Vol. 13 (1) , 143-159
- https://doi.org/10.1109/72.977291
Abstract
Feature selection plays an important role in classifying systems such as neural networks (NNs). We use a set of attributes which are relevant, irrelevant or redundant and from the viewpoint of managing a dataset which can be huge, reducing the number of attributes by selecting only the relevant ones is desirable. In doing so, higher performances with lower computational effort is expected. In this paper, we propose two feature selection algorithms. The limitation of mutual information feature selector (MIFS) is analyzed and a method to overcome this limitation is studied. One of the proposed algorithms makes more considered use of mutual information between input attributes and output classes than the MIFS. What is demonstrated is that the proposed method can provide the performance of the ideal greedy selection algorithm when information is distributed uniformly. The computational load for this algorithm is nearly the same as that of MIFS. In addition, another feature selection algorithm using the Taguchi method is proposed. This is advanced as a solution to the question as to how to identify good features with as few experiments as possible. The proposed algorithms are applied to several classification problems and compared with MIFS. These two algorithms can be combined to complement each other's limitations. The combined algorithm performed well in several experiments and should prove to be a useful method in selecting features for classification problems.Keywords
This publication has 12 references indexed in Scilit:
- Knowledge mining by imprecise querying: a classification-based approachPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Principal feature classificationIEEE Transactions on Neural Networks, 1997
- Data mining: an overview from a database perspectiveIEEE Transactions on Knowledge and Data Engineering, 1996
- Integrated feature architecture selectionIEEE Transactions on Neural Networks, 1996
- Using Taguchi's method of experimental design to control errors in layered perceptronsIEEE Transactions on Neural Networks, 1995
- Using mutual information for selecting features in supervised neural net learningIEEE Transactions on Neural Networks, 1994
- Database mining: a performance perspectiveIEEE Transactions on Knowledge and Data Engineering, 1993
- Bayesian selection of important features for feedforward neural networksNeurocomputing, 1993
- Taguchi on Robust Technology DevelopmentPublished by ASME International ,1993
- Principal Component AnalysisPublished by Springer Nature ,1986