Associative learning in random environments using neural networks
- 1 January 1991
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Neural Networks
- Vol. 2 (1) , 20-31
- https://doi.org/10.1109/72.80288
Abstract
Associative learning is investigated using neural networks and concepts based on learning automata. The behavior of a single decision-maker containing a neural network is studied in a random environment using reinforcement learning. The objective is to determine the optimal action corresponding to a particular state. Since decisions have to be made throughout the context space based on a countable number of experiments, generalization is inevitable. Many different approaches can be followed to generate the desired discriminant function. Three different methods which use neural networks are discussed and compared. In the most general method, the output of the network determines the probability with which one of the actions is to be chosen. The weights of the network are updated on the basis of the actions and the response of the environment. The extension of similar concepts to decentralized decision-making in a context space is also introduced. Simulation results are included. Modifications in the implementations of the most general method to make it practically viable are also presented. All the methods suggested are feasible and the choice of a specific method depends on the accuracy desired as well as on the available computational power.Keywords
This publication has 10 references indexed in Scilit:
- Identification and control of dynamical systems using neural networksIEEE Transactions on Neural Networks, 1990
- ATM communications network control by neural networksIEEE Transactions on Neural Networks, 1990
- Experiments on neural net recognition of spoken and written textIEEE Transactions on Acoustics, Speech, and Signal Processing, 1988
- Learned classification of sonar targets using a massively parallel networkIEEE Transactions on Acoustics, Speech, and Signal Processing, 1988
- Layered neural nets for pattern recognitionIEEE Transactions on Acoustics, Speech, and Signal Processing, 1988
- Nonstationary models of learning automata routing in data communication networksIEEE Transactions on Systems, Man, and Cybernetics, 1987
- Decentralized learning in finite Markov chainsIEEE Transactions on Automatic Control, 1986
- A cooperative game of a pair of learning automataAutomatica, 1984
- An N-player sequential stochastic game with identical payoffsIEEE Transactions on Systems, Man, and Cybernetics, 1983
- Stochastic Automata Models with Applications to Learning SystemsIEEE Transactions on Systems, Man, and Cybernetics, 1973