Training Recurrent Networks by Evolino
- 1 March 2007
- journal article
- Published by MIT Press in Neural Computation
- Vol. 19 (3) , 757-779
- https://doi.org/10.1162/neco.2007.19.3.757
Abstract
In recent years, gradient-based LSTM recurrent neural networks (RNNs) solved many previously RNN-unlearnable tasks. Sometimes, however, gradient information is of little use for training RNNs, due to numerous local minima. For such cases, we present a novel method: EVOlution of systems with LINear Outputs (Evolino). Evolino evolves weights to the nonlinear, hidden nodes of RNNs while computing optimal linear mappings from hidden state to output, using methods such as pseudo-inverse-based linear regression. If we instead use quadratic programming to maximize the margin, we obtain the first evolutionary recurrent support vector machines. We show that Evolino-based LSTM can solve tasks that Echo State nets (Jaeger, 2004a) cannot and achieves higher accuracy in certain continuous function generation tasks than conventional gradient descent RNNs, including gradient-based LSTM.Keywords
This publication has 18 references indexed in Scilit:
- Framewise phoneme classification with bidirectional LSTM and other neural network architecturesNeural Networks, 2005
- Harnessing Nonlinearity: Predicting Chaotic Systems and Saving Energy in Wireless CommunicationScience, 2004
- LSTM recurrent networks learn simple context-free and context-sensitive languagesIEEE Transactions on Neural Networks, 2001
- Learning to Forget: Continual Prediction with LSTMNeural Computation, 2000
- Long Short-Term MemoryNeural Computation, 1997
- Efficient reinforcement learning through symbiotic evolutionMachine Learning, 1996
- Gradient calculations for dynamic recurrent neural networks: a surveyIEEE Transactions on Neural Networks, 1995
- Evolving Mobile Robots in Simulated and Real EnvironmentsArtificial Life, 1995
- Oscillation and Chaos in Physiological Control SystemsScience, 1977
- A generalized inverse for matricesMathematical Proceedings of the Cambridge Philosophical Society, 1955