Associative learning in random environments using neural networks

1 January 1991

journal article
Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Neural Networks

Vol. 2 (1) , 20-31
https://doi.org/10.1109/72.80288

Abstract

Associative learning is investigated using neural networks and concepts based on learning automata. The behavior of a single decision-maker containing a neural network is studied in a random environment using reinforcement learning. The objective is to determine the optimal action corresponding to a particular state. Since decisions have to be made throughout the context space based on a countable number of experiments, generalization is inevitable. Many different approaches can be followed to generate the desired discriminant function. Three different methods which use neural networks are discussed and compared. In the most general method, the output of the network determines the probability with which one of the actions is to be chosen. The weights of the network are updated on the basis of the actions and the response of the environment. The extension of similar concepts to decentralized decision-making in a context space is also introduced. Simulation results are included. Modifications in the implementations of the most general method to make it practically viable are also presented. All the methods suggested are feasible and the choice of a specific method depends on the accuracy desired as well as on the available computational power.

Keywords

This publication has 10 references indexed in Scilit:

Identification and control of dynamical systems using neural networks
IEEE Transactions on Neural Networks, 1990
ATM communications network control by neural networks
IEEE Transactions on Neural Networks, 1990
Experiments on neural net recognition of spoken and written text
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1988
Learned classification of sonar targets using a massively parallel network
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1988
Layered neural nets for pattern recognition
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1988
Nonstationary models of learning automata routing in data communication networks
IEEE Transactions on Systems, Man, and Cybernetics, 1987
Decentralized learning in finite Markov chains
IEEE Transactions on Automatic Control, 1986
A cooperative game of a pair of learning automata
Automatica, 1984
An N-player sequential stochastic game with identical payoffs
IEEE Transactions on Systems, Man, and Cybernetics, 1983
Stochastic Automata Models with Applications to Learning Systems
IEEE Transactions on Systems, Man, and Cybernetics, 1973