Classification of linearly nonseparable patterns by linear threshold elements
- 1 March 1995
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Neural Networks
- Vol. 6 (2) , 318-331
- https://doi.org/10.1109/72.363468
Abstract
Learning and convergence properties of linear threshold elements or perceptrons are well understood for the case where the input vectors (or the training sets) to the perceptron are linearly separable. Little is known, however, about the behavior of the perceptron learning algorithm when the training sets are linearly nonseparable. We present the first known results on the structure of linearly nonseparable training sets and on the behavior of perceptrons when the set of input vectors is linearly nonseparable. More precisely, we show that using the well known perceptron learning algorithm, a linear threshold element can learn the input vectors that are provably learnable, and identify those vectors that cannot be learned without committing errors. We also show how a linear threshold element can be used to learn large linearly separable subsets of any given nonseparable training set. In order to develop our results, we first establish formal characterizations of linearly nonseparable training sets and define learnable structures for such patterns. We also prove computational complexity results for the related learning problems. Next, based on such characterizations, we show that a perceptron does the best one can expect for linearly nonseparable sets of input vectors and learns as much as is theoretically possible.Keywords
This publication has 7 references indexed in Scilit:
- Training a 3-node neural network is NP-completeNeural Networks, 1992
- Adaptive Ho-Kashyap rules for perceptron trainingIEEE Transactions on Neural Networks, 1992
- 30 years of adaptive neural networks: perceptron, Madaline, and backpropagationProceedings of the IEEE, 1990
- The complexity of information extractionIEEE Transactions on Information Theory, 1986
- The complexity of analog computationMathematics and Computers in Simulation, 1986
- Geometrical and Statistical Properties of Systems of Linear Inequalities with Applications in Pattern RecognitionIEEE Transactions on Electronic Computers, 1965
- A logical calculus of the ideas immanent in nervous activityBulletin of Mathematical Biology, 1943