Redundancy reduction with information-preserving nonlinear maps
- 1 February 1995
- journal article
- Published by Taylor & Francis in Network: Computation in Neural Systems
- Vol. 6 (1) , 61-72
- https://doi.org/10.1088/0954-898x/6/1/004
Abstract
The basic idea of linear principal component analysis (PCA) involves decorrelating coordinates by an orthogonal linear transformation. In this paper we generalize this idea to the nonlinear case. Simultaneously we shall drop the usual restriction to Gaussian distributions. The linearity and orthogonality condition of linear PCA is replaced by the condition of volume conservation in order to avoid spurious information generated by the nonlinear transformation. This leads us to another very general class of nonlinear transformations, called symplectic maps. Later, instead of minimizing the correlation, we minimize the redundancy measured at the output coordinates. This generalizes second-order statistics, being only valid for Gaussian output distributions, to higher-order statistics. The proposed paradigm implements Barlow's redundancy-reduction principle for unsupervised feature extraction. The resulting factorial representation of the joint probability distribution presumably facilitates density estimation and is applied in particular to novelty detection.Keywords
This publication has 20 references indexed in Scilit:
- Independent component analysis, A new concept?Signal Processing, 1994
- Blind separation of sources: A nonlinear neural algorithmNeural Networks, 1992
- What Does the Retina Know about Natural Scenes?Neural Computation, 1992
- Blind separation of sources, part II: Problems statementSignal Processing, 1991
- Towards a Theory of Early Visual ProcessingNeural Computation, 1990
- Unsupervised LearningNeural Computation, 1989
- Principal CurvesJournal of the American Statistical Association, 1989
- Multilayer feedforward networks are universal approximatorsNeural Networks, 1989
- Neural networks and principal component analysis: Learning from examples without local minimaNeural Networks, 1989
- Auto-association by multilayer perceptrons and singular value decompositionBiological Cybernetics, 1988