Conditional distribution learning with neural networks and its application to channel equalization

1 April 1997

journal article
research article
Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Signal Processing

Vol. 45 (4) , 1051-1064
https://doi.org/10.1109/78.564193

Abstract

We present a conditional distribution learning formulation for real-time signal processing with neural networks based on a recent extension of maximum likelihood theory-partial likelihood (PL) estimation-which allows for i) dependent observations and ii) sequential processing. For a general neural network conditional distribution model, we establish a fundamental information-theoretic connection, the equivalence of maximum PL estimation, and accumulated relative entropy (ARE) minimization, and obtain large sample properties of PL for the general case of dependent observations. As an example, the binary ease with the sigmoidal perceptron as the probability model is presented. It is shown that the single and multilayer perceptron (MLP) models satisfy conditions for the equivalence of the two cost functions: ARE anti negative log partial likelihood. The practical issue of their gradient descent minimization is then studied within the well-formed cost functions framework. It is shown that these are well-formed cost functions for networks without hidden units; hence, their gradient descent minimization is guaranteed to converge to a solution if one exists on such networks. The formulation is applied to adaptive channel equalization, and simulation results are presented to show the ability of the least relative entropy equalizer to realize complex decision boundaries and to recover during training from convergence at the wrong extreme in cases where the mean square error-based MLP equalizer cannot.

Keywords

This publication has 21 references indexed in Scilit:

A maximum partial likelihood framework for channel equalization by distribution learning
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Recurrent radial basis function networks for optimal symbol-by-symbol equalization
Signal Processing, 1994
Estimation, Inference and Specification Analysis
Published by Cambridge University Press (CUP) ,1994
Hierarchical Mixtures of Experts and the EM Algorithm
Neural Computation, 1994
The application of nonlinear structures to the reconstruction of binary signals
IEEE Transactions on Signal Processing, 1991
Adaptive equalization of finite non-linear channels using multilayer perceptrons
Signal Processing, 1990
Learning in Artificial Neural Networks: A Statistical Perspective
Neural Computation, 1989
Theory of Partial Likelihood
The Annals of Statistics, 1986
Partial likelihood
Biometrika, 1975
Maximum-likelihood sequence estimation of digital sequences in the presence of intersymbol interference
IEEE Transactions on Information Theory, 1972