Classification of linearly nonseparable patterns by linear threshold elements

1 March 1995

journal article
Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Neural Networks

Vol. 6 (2) , 318-331
https://doi.org/10.1109/72.363468

Abstract

Learning and convergence properties of linear threshold elements or perceptrons are well understood for the case where the input vectors (or the training sets) to the perceptron are linearly separable. Little is known, however, about the behavior of the perceptron learning algorithm when the training sets are linearly nonseparable. We present the first known results on the structure of linearly nonseparable training sets and on the behavior of perceptrons when the set of input vectors is linearly nonseparable. More precisely, we show that using the well known perceptron learning algorithm, a linear threshold element can learn the input vectors that are provably learnable, and identify those vectors that cannot be learned without committing errors. We also show how a linear threshold element can be used to learn large linearly separable subsets of any given nonseparable training set. In order to develop our results, we first establish formal characterizations of linearly nonseparable training sets and define learnable structures for such patterns. We also prove computational complexity results for the related learning problems. Next, based on such characterizations, we show that a perceptron does the best one can expect for linearly nonseparable sets of input vectors and learns as much as is theoretically possible.

Keywords

This publication has 7 references indexed in Scilit:

Training a 3-node neural network is NP-complete
Neural Networks, 1992
Adaptive Ho-Kashyap rules for perceptron training
IEEE Transactions on Neural Networks, 1992
30 years of adaptive neural networks: perceptron, Madaline, and backpropagation
Proceedings of the IEEE, 1990
The complexity of information extraction
IEEE Transactions on Information Theory, 1986
The complexity of analog computation
Mathematics and Computers in Simulation, 1986
Geometrical and Statistical Properties of Systems of Linear Inequalities with Applications in Pattern Recognition
IEEE Transactions on Electronic Computers, 1965
A logical calculus of the ideas immanent in nervous activity
Bulletin of Mathematical Biology, 1943