Adding Temporary Memory to ZCS
- 1 September 1994
- journal article
- research article
- Published by SAGE Publications in Adaptive Behavior
- Vol. 3 (2) , 101-150
- https://doi.org/10.1177/105971239400300201
Abstract
In a recent article, Wilson (1994) described a "zeroth-level" classifier system (ZCS). ZCS employs a reinforcement learning technique comparable to Q-learning (Watkins, 1989). This article presents results from the first reconstruction of ZCS. Having replicated Wilson's results, we extend ZCS in a manner suggested by Wilson: The original formulation of ZCS has no memory mechanisms, but Wilson (1994b) suggested how internal "temporary memory" registers could be added. We show results from adding one-bit and two-bit memory registers to ZCS. Our results demonstrate that ZCS can exploit memory facilities efficiently in non-Markov environments. We also show that the memoryless ZCS can converge on near-optimal stochastic solutions in non-Markov environments. We then present results from trials using ZCS in Markov environments that require increasingly long chains of actions before reward is received. Our results indicate that inaccurate overgeneral classifiers can interact with the classifier-generation mechanisms to cause catastrophic breakdowns in overall system performance. Basing classifier fitness on accuracy may alleviate this problem. We conclude that the memory mechanism in its current form is unlikely to scale well for situations requiring large amounts of temporary memory. Nevertheless, the ability to find stochastic solutions when there is insufficient memory might offset this problem somewhat.Keywords
This publication has 15 references indexed in Scilit:
- Training Agents to Perform Sequential BehaviorAdaptive Behavior, 1994
- Genetic and Non-Genetic Operators in ALECSYSEvolutionary Computation, 1993
- Adding "Foveal Vision" to Wilson's AnimatAdaptive Behavior, 1993
- Automatic programming of behavior-based robots using reinforcement learningArtificial Intelligence, 1992
- Letter recognition using Holland-style adaptive classifiersMachine Learning, 1991
- Learning sequential decision rules using simulation models and competitionMachine Learning, 1990
- Learning and bucket brigade dynamics in classifier systemsPhysica D: Nonlinear Phenomena, 1990
- Landmark maps for honeybeesBiological Cybernetics, 1987
- Landmark learning in beesJournal of Comparative Physiology A, 1983
- AdaptationPublished by Elsevier ,1976