Simulation with learning agents

1 February 2001

journal article
Published by Institute of Electrical and Electronics Engineers (IEEE) in Proceedings of the IEEE

Vol. 89 (2) , 148-157
https://doi.org/10.1109/5.910851

Abstract

We propose that learning agents (LAs) be incorporated into simulation environments in order to model the adaptive behavior of humans. These LAs adapt to specific circumstances and events during the simulation run. They would select tasks to be accomplished among a given set of tasks as the simulation progresses, or synthesize tasks for themselves based on their observations of the environment and on information they may receive from other agents. We investigate an approach in which agents are assigned goals when the simulation starts and then pursue these goals autonomously and adaptively. During the simulation, agents progressively improve their ability to accomplish their goals effectively and safely. Agents learn from their own observations and from the experience of other agents with whom they exchange information. Each LA starts with a given representation of the simulation environment from which it progressively constructs its own internal representation and uses it to make decisions. The paper describes how learning neural networks can support this approach and shows that goal based learning may be used effectively used in this context. An example simulation is presented in which agents represent manned vehicles; they are assigned the goal of traversing a dangerous metropolitan grid safely and rapidly using goal based reinforcement learning with neural networks and compared to three other algorithms.

Keywords

This publication has 8 references indexed in Scilit:

Reinforcement learning with internal expectation for the random neural network
European Journal of Operational Research, 2000
Function approximation with spiked random networks
IEEE Transactions on Neural Networks, 1999
Learning in the Recurrent Random Neural Network
Neural Computation, 1993
Learning to predict by the methods of temporal differences
Machine Learning, 1988
The use of learning algorithms in telephone traffic routing—A methodology
Automatica, 1983
Comparison of Expedient and Optima Reinforcement Schemes for Learning Systems
Journal of Cybernetics, 1972
A Realizable Model for Stochastic Sequential Machines
IEEE Transactions on Computers, 1971
On probabilistic automata with structural restrictions
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1969