Optimal ensemble averaging of neural networks
- 1 January 1997
- journal article
- Published by Taylor & Francis in Network: Computation in Neural Systems
- Vol. 8 (3) , 283-296
- https://doi.org/10.1088/0954-898x_8_3_004
Abstract
Based on an observation about the different effect of ensemble averaging on the bias and variance portions of the prediction error, we discuss training methodologies for ensembles of networks. We demonstrate the effect of variance reduction and present a method of extrapolation to the limit of an infinite ensemble. A significant reduction of variance is obtained by averaging just over initial conditions of the neural networks, without varying architectures or training sets. The minimum of the ensemble prediction error is reached later than that of a single network. In the vicinity of the minimum, the ensemble prediction error appears to be flatter than that of the single network, thus simplifying optimal stopping decision. The results are demonstrated on sunspots data, where the predictions are among the best obtained, and on the 1993 energy prediction competition data set B.†School of Physics and Astronomy. ury@tarazan.tau.ac.il.‡School of Mathematical Sciences. nin@math.tau.ac.il.§School of Physics and ...Keywords
This publication has 9 references indexed in Scilit:
- Finding the Embedding Dimension and Variable Dependencies in Time SeriesNeural Computation, 1994
- Simplifying Neural Networks by Soft Weight-SharingNeural Computation, 1992
- Neural Networks and the Bias/Variance DilemmaNeural Computation, 1992
- Stacked generalizationNeural Networks, 1992
- Finding Structure in TimeCognitive Science, 1990
- PREDICTING THE FUTURE: A CONNECTIONIST APPROACHInternational Journal of Neural Systems, 1990
- Learning the hidden structure of speechThe Journal of the Acoustical Society of America, 1988
- Threshold Models in Non-linear Time Series AnalysisPublished by Springer Nature ,1983
- Forecasting the Sunspot CycleJournal of the Royal Statistical Society. Series A (General), 1977