Optimized feature extraction and the Bayes decision in feed-forward classifier networks
- 1 April 1991
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 13 (4) , 355-364
- https://doi.org/10.1109/34.88570
Abstract
The problem of multiclass pattern classification using adaptive layered networks is addressed. A special class of networks, i.e., feed-forward networks with a linear final layer, that perform generalized linear discriminant analysis is discussed, This class is sufficiently generic to encompass the behavior of arbitrary feed-forward nonlinear networks. Training the network consists of a least-square approach which combines a generalized inverse computation to solve for the final layer weights, together with a nonlinear optimization scheme to solve for parameters of the nonlinearities. A general analytic form for the feature extraction criterion is derived, and it is interpreted for specific forms of target coding and error weighting. An important aspect of the approach is to exhibit how a priori information regarding nonuniform class membership, uneven distribution between train and test sets, and misclassification costs may be exploited in a regularized manner in the training phase of networks.Keywords
This publication has 17 references indexed in Scilit:
- Exploiting prior knowledge in network optimization: an illustration from medical prognosisNetwork: Computation in Neural Systems, 1990
- The optimised internal representation of multilayer classifier networks performs nonlinear discriminant analysisNeural Networks, 1990
- On hidden nodes for neural netsIEEE Transactions on Circuits and Systems, 1989
- Analysis of hidden units in a layered network trained to classify sonar targetsNeural Networks, 1988
- Induction of decision treesMachine Learning, 1986
- 39 Dimensionality and sample size considerations in pattern recognition practicePublished by Elsevier ,1982
- An Analysis of the Total Least Squares ProblemSIAM Journal on Numerical Analysis, 1980
- LEAST-MEAN-SQUARE APPROACH TO PATTERN CLASSIFICATION**The work reported here is supported in part by U.S. PHS Grant No. 2 PO1 GM 15418-09.Published by Elsevier ,1972
- Least-square methods in abstract pattern recognitionInformation Sciences, 1968
- Calculating the Singular Values and Pseudo-Inverse of a MatrixJournal of the Society for Industrial and Applied Mathematics Series B Numerical Analysis, 1965