On the probabilistic interpretation of neural network classifiers and discriminative training criteria
- 1 January 1995
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 17 (2) , 107-119
- https://doi.org/10.1109/34.368176
Abstract
A probabilistic interpretation is presented for two important issues in neural network based classification, namely the interpretation of discriminative training criteria and the neural network outputs as well as the interpretation of the structure of the neural network. The problem of finding a suitable structure of the neural network can be linked to a number of well established techniques in statistical pattern recognition, such as the method of potential functions, kernel densities, and continuous mixture densities. Discriminative training of neural network outputs amounts to approximating the class or posterior probabilities of the classical statistical approach. This paper extends these links by introducing and analyzing novel criteria such as maximizing the class probability and minimizing the smoothed error rate. These criteria are defined in the framework of class-conditional probability density functions. We will show that these criteria can be interpreted in terms of weighted maximum likelihood estimation, where the weights depend in a complicated nonlinear fashion on the model parameters to be trained. In particular, this approach covers widely used techniques such as corrective training, learning vector quantization, and linear discriminant analysis.Keywords
This publication has 27 references indexed in Scilit:
- A probabilistic approach to the understanding and training of neural network classifiersPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Neural networks, maximum mutual information training, and maximum likelihood training (speech recognition)Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Neural Network Classifiers Estimate Bayesian a posteriori ProbabilitiesNeural Computation, 1991
- Networks for approximation and learningProceedings of the IEEE, 1990
- A new error criterion for posterior probability estimation with neural netsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1990
- Probabilistic neural networksNeural Networks, 1990
- Learning in Artificial Neural Networks: A Statistical PerspectiveNeural Computation, 1989
- Phoneme classification experiments using radial basis functionsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1989
- Consistent inference of probabilities in layered networks: predictions and generalizationsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1989
- An Adaptive Pattern Classification SystemIEEE Transactions on Systems Science and Cybernetics, 1966