Learning Signaling Behaviors and Specialization in Cooperative Agents

Abstract
In this article, we present a learning mechanism that allows a multiagent system to cooperate to achieve a gathering task efficiently in unknown and changing environments. The multiagent system is a team of autonomous behavior-based agents with limited communication capabilities. Cooperation is based on the acquisition of signaling behaviors and on the specialization of the agents into different types. Every agent has the same collection of built-in reactive behaviors. Some of the built-in behaviors are fixed, whereas others can be modified through reinforcement learning. The reinforcement signal is delayed until a trial is completed and assesses the collective performance of the team. Each agent uses this common signal to learn what individual behaviors are more suitable for the team. Simulation results, and the corresponding statistical analysis, show that the multiagent system always achieves near-optimal performances.