Simulation with learning agents
- 1 February 2001
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in Proceedings of the IEEE
- Vol. 89 (2) , 148-157
- https://doi.org/10.1109/5.910851
Abstract
We propose that learning agents (LAs) be incorporated into simulation environments in order to model the adaptive behavior of humans. These LAs adapt to specific circumstances and events during the simulation run. They would select tasks to be accomplished among a given set of tasks as the simulation progresses, or synthesize tasks for themselves based on their observations of the environment and on information they may receive from other agents. We investigate an approach in which agents are assigned goals when the simulation starts and then pursue these goals autonomously and adaptively. During the simulation, agents progressively improve their ability to accomplish their goals effectively and safely. Agents learn from their own observations and from the experience of other agents with whom they exchange information. Each LA starts with a given representation of the simulation environment from which it progressively constructs its own internal representation and uses it to make decisions. The paper describes how learning neural networks can support this approach and shows that goal based learning may be used effectively used in this context. An example simulation is presented in which agents represent manned vehicles; they are assigned the goal of traversing a dangerous metropolitan grid safely and rapidly using goal based reinforcement learning with neural networks and compared to three other algorithms.Keywords
This publication has 8 references indexed in Scilit:
- Reinforcement learning with internal expectation for the random neural networkEuropean Journal of Operational Research, 2000
- Function approximation with spiked random networksIEEE Transactions on Neural Networks, 1999
- Learning in the Recurrent Random Neural NetworkNeural Computation, 1993
- Learning to predict by the methods of temporal differencesMachine Learning, 1988
- The use of learning algorithms in telephone traffic routing—A methodologyAutomatica, 1983
- Comparison of Expedient and Optima Reinforcement Schemes for Learning SystemsJournal of Cybernetics, 1972
- A Realizable Model for Stochastic Sequential MachinesIEEE Transactions on Computers, 1971
- On probabilistic automata with structural restrictionsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1969