Storage capacity and learning algorithms for two-layer neural networks
- 1 May 1992
- journal article
- research article
- Published by American Physical Society (APS) in Physical Review A
- Vol. 45 (10) , 7590-7609
- https://doi.org/10.1103/physreva.45.7590
Abstract
A two-layer feedforward network of McCulloch-Pitts neurons with N inputs and K hidden units is analyzed for N→∞ and K finite with respect to its ability to implement p=αN random input-output relations. Special emphasis is put on the case where all hidden units are coupled to the output with the same strength (committee machine) and the receptive fields of the hidden units either enclose all input units (fully connected) or are nonoverlapping (tree structure). The storage capacity is determined generalizing Gardner’s treatment [J. Phys. A 21, 257 (1988); Europhys. Lett. 4, 481 (1987)] of the single-layer perceptron. For the treelike architecture, a replica-symmetric calculation yields ∝ √K for a large number K of hidden units. This result violates an upper bound derived by Mitchison and Durbin [Biol. Cybern. 60, 345 (1989)]. One-step replica-symmetry breaking gives lower values of .
Keywords
This publication has 23 references indexed in Scilit:
- Statistical mechanics of neural networks near saturationPublished by Elsevier ,2004
- Tilinglike learning in the parity machinePhysical Review A, 1991
- Storage Capacity of a Multilayer Neural Network with Binary WeightsEurophysics Letters, 1991
- Statistical mechanics of a multilayered neural networkPhysical Review Letters, 1990
- Learning in feedforward layered networks: the tiling algorithmJournal of Physics A: General Physics, 1989
- Bounds on the learning capacity of some multi-layer networksBiological Cybernetics, 1989
- On the capabilities of multilayer perceptronsJournal of Complexity, 1988
- The space of interactions in neural network modelsJournal of Physics A: General Physics, 1988
- Maximum Storage Capacity in Neural NetworksEurophysics Letters, 1987
- Geometrical and Statistical Properties of Systems of Linear Inequalities with Applications in Pattern RecognitionIEEE Transactions on Electronic Computers, 1965