Operant Conditioning in Skinnerbots
- 1 January 1997
- journal article
- Published by SAGE Publications in Adaptive Behavior
- Vol. 5 (3-4) , 219-247
- https://doi.org/10.1177/105971239700500302
Abstract
Instrumental (or operant) conditioning, a form of animal learning, is similar to reinforcement learning (Watkins, 1989) in that it allows an agent to adapt its actions to gain maximally from the environment while being rewarded only for correct performance. However, animals learn much more complicated behaviors through instrumental conditioning than robots presently acquire through reinforcement learning. We describe a new computational model of the conditioning process that attempts to capture some of the aspects that are missing from simple reinforcement learning: conditioned reinforcers, shifting reinforcement contingencies, explicit action sequencing, and state space refinement. We apply our model to a task commonly used to study working memory in rats and monkeys—the delayed match-to-sample task. Animals learn this task in stages. In simulation, our model also acquires the task in stages, in a similar manner. We have used the model to train an RWI B21 robot.Keywords
This publication has 27 references indexed in Scilit:
- Rapid, safe, and incremental learning of navigation strategiesIEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), 1996
- Behavior analysis and training-a methodology for behavior engineeringIEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), 1996
- Multilevel analysis of classical conditioning in a behaving real world artifactRobotics and Autonomous Systems, 1995
- Bee foraging in uncertain environments using predictive hebbian learningNature, 1995
- Designing and Understanding Adaptive Group BehaviorAdaptive Behavior, 1995
- Robot shaping: developing autonomous agents through learningArtificial Intelligence, 1994
- Sniffy, the virtual rat: Simulated operant conditioningBehavior Research Methods, Instruments & Computers, 1994
- Structured control for autonomous robotsIEEE Transactions on Robotics and Automation, 1994
- Hippocampal cell firing correlates of delayed-match-to-sample performance in the rat.Behavioral Neuroscience, 1993
- Natural syntax rules control action sequence of ratsBehavioural Brain Research, 1987