Optimal ensemble averaging of neural networks

1 January 1997

journal article
Published by Taylor & Francis in Network: Computation in Neural Systems

Vol. 8 (3) , 283-296
https://doi.org/10.1088/0954-898x_8_3_004

Abstract

Based on an observation about the different effect of ensemble averaging on the bias and variance portions of the prediction error, we discuss training methodologies for ensembles of networks. We demonstrate the effect of variance reduction and present a method of extrapolation to the limit of an infinite ensemble. A significant reduction of variance is obtained by averaging just over initial conditions of the neural networks, without varying architectures or training sets. The minimum of the ensemble prediction error is reached later than that of a single network. In the vicinity of the minimum, the ensemble prediction error appears to be flatter than that of the single network, thus simplifying optimal stopping decision. The results are demonstrated on sunspots data, where the predictions are among the best obtained, and on the 1993 energy prediction competition data set B.†School of Physics and Astronomy. ury@tarazan.tau.ac.il.‡School of Mathematical Sciences. nin@math.tau.ac.il.§School of Physics and ...

Keywords

This publication has 9 references indexed in Scilit:

Finding the Embedding Dimension and Variable Dependencies in Time Series
Neural Computation, 1994
Simplifying Neural Networks by Soft Weight-Sharing
Neural Computation, 1992
Neural Networks and the Bias/Variance Dilemma
Neural Computation, 1992
Stacked generalization
Neural Networks, 1992
Finding Structure in Time
Cognitive Science, 1990
PREDICTING THE FUTURE: A CONNECTIONIST APPROACH
International Journal of Neural Systems, 1990
Learning the hidden structure of speech
The Journal of the Acoustical Society of America, 1988
Threshold Models in Non-linear Time Series Analysis
Published by Springer Nature ,1983
Forecasting the Sunspot Cycle
Journal of the Royal Statistical Society. Series A (General), 1977