Redundancy reduction with information-preserving nonlinear maps

1 February 1995

journal article
Published by Taylor & Francis in Network: Computation in Neural Systems

Vol. 6 (1) , 61-72
https://doi.org/10.1088/0954-898x/6/1/004

Abstract

The basic idea of linear principal component analysis (PCA) involves decorrelating coordinates by an orthogonal linear transformation. In this paper we generalize this idea to the nonlinear case. Simultaneously we shall drop the usual restriction to Gaussian distributions. The linearity and orthogonality condition of linear PCA is replaced by the condition of volume conservation in order to avoid spurious information generated by the nonlinear transformation. This leads us to another very general class of nonlinear transformations, called symplectic maps. Later, instead of minimizing the correlation, we minimize the redundancy measured at the output coordinates. This generalizes second-order statistics, being only valid for Gaussian output distributions, to higher-order statistics. The proposed paradigm implements Barlow's redundancy-reduction principle for unsupervised feature extraction. The resulting factorial representation of the joint probability distribution presumably facilitates density estimation and is applied in particular to novelty detection.

Keywords

This publication has 20 references indexed in Scilit:

Independent component analysis, A new concept?
Signal Processing, 1994
Blind separation of sources: A nonlinear neural algorithm
Neural Networks, 1992
What Does the Retina Know about Natural Scenes?
Neural Computation, 1992
Blind separation of sources, part II: Problems statement
Signal Processing, 1991
Towards a Theory of Early Visual Processing
Neural Computation, 1990
Unsupervised Learning
Neural Computation, 1989
Principal Curves
Journal of the American Statistical Association, 1989
Multilayer feedforward networks are universal approximators
Neural Networks, 1989
Neural networks and principal component analysis: Learning from examples without local minima
Neural Networks, 1989
Auto-association by multilayer perceptrons and singular value decomposition
Biological Cybernetics, 1988