On the probabilistic interpretation of neural network classifiers and discriminative training criteria

1 January 1995

journal article
Published by Institute of Electrical and Electronics Engineers (IEEE)

Vol. 17 (2) , 107-119
https://doi.org/10.1109/34.368176

Abstract

A probabilistic interpretation is presented for two important issues in neural network based classification, namely the interpretation of discriminative training criteria and the neural network outputs as well as the interpretation of the structure of the neural network. The problem of finding a suitable structure of the neural network can be linked to a number of well established techniques in statistical pattern recognition, such as the method of potential functions, kernel densities, and continuous mixture densities. Discriminative training of neural network outputs amounts to approximating the class or posterior probabilities of the classical statistical approach. This paper extends these links by introducing and analyzing novel criteria such as maximizing the class probability and minimizing the smoothed error rate. These criteria are defined in the framework of class-conditional probability density functions. We will show that these criteria can be interpreted in terms of weighted maximum likelihood estimation, where the weights depend in a complicated nonlinear fashion on the model parameters to be trained. In particular, this approach covers widely used techniques such as corrective training, learning vector quantization, and linear discriminant analysis.

Keywords

This publication has 27 references indexed in Scilit:

A probabilistic approach to the understanding and training of neural network classifiers
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Neural networks, maximum mutual information training, and maximum likelihood training (speech recognition)
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Neural Network Classifiers Estimate Bayesian a posteriori Probabilities
Neural Computation, 1991
Networks for approximation and learning
Proceedings of the IEEE, 1990
A new error criterion for posterior probability estimation with neural nets
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1990
Probabilistic neural networks
Neural Networks, 1990
Learning in Artificial Neural Networks: A Statistical Perspective
Neural Computation, 1989
Phoneme classification experiments using radial basis functions
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1989
Consistent inference of probabilities in layered networks: predictions and generalizations
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1989
An Adaptive Pattern Classification System
IEEE Transactions on Systems Science and Cybernetics, 1966