Adding Temporary Memory to ZCS

1 September 1994

journal article
research article
Published by SAGE Publications in Adaptive Behavior

Vol. 3 (2) , 101-150
https://doi.org/10.1177/105971239400300201

Abstract

In a recent article, Wilson (1994) described a "zeroth-level" classifier system (ZCS). ZCS employs a reinforcement learning technique comparable to Q-learning (Watkins, 1989). This article presents results from the first reconstruction of ZCS. Having replicated Wilson's results, we extend ZCS in a manner suggested by Wilson: The original formulation of ZCS has no memory mechanisms, but Wilson (1994b) suggested how internal "temporary memory" registers could be added. We show results from adding one-bit and two-bit memory registers to ZCS. Our results demonstrate that ZCS can exploit memory facilities efficiently in non-Markov environments. We also show that the memoryless ZCS can converge on near-optimal stochastic solutions in non-Markov environments. We then present results from trials using ZCS in Markov environments that require increasingly long chains of actions before reward is received. Our results indicate that inaccurate overgeneral classifiers can interact with the classifier-generation mechanisms to cause catastrophic breakdowns in overall system performance. Basing classifier fitness on accuracy may alleviate this problem. We conclude that the memory mechanism in its current form is unlikely to scale well for situations requiring large amounts of temporary memory. Nevertheless, the ability to find stochastic solutions when there is insufficient memory might offset this problem somewhat.

Keywords

This publication has 15 references indexed in Scilit:

Training Agents to Perform Sequential Behavior
Adaptive Behavior, 1994
Genetic and Non-Genetic Operators in ALECSYS
Evolutionary Computation, 1993
Adding "Foveal Vision" to Wilson's Animat
Adaptive Behavior, 1993
Automatic programming of behavior-based robots using reinforcement learning
Artificial Intelligence, 1992
Letter recognition using Holland-style adaptive classifiers
Machine Learning, 1991
Learning sequential decision rules using simulation models and competition
Machine Learning, 1990
Learning and bucket brigade dynamics in classifier systems
Physica D: Nonlinear Phenomena, 1990
Landmark maps for honeybees
Biological Cybernetics, 1987
Landmark learning in bees
Journal of Comparative Physiology A, 1983
Adaptation
Published by Elsevier ,1976