• preprint
    • Published in RePEc
Abstract
We consider the model of social learning by Schlag (1996). Individuals must repeatedly choose an action in a multi-armed bandit. We assume that each indivdiual observes the outcomes of two other individuals' choices before her own next choice must be made -- the original model only allows for one observation. Selection of optimal behavior yields a variant of the proportional imitation rule -- the optimal rule based on one observation. When each individual uses this rule then the adaptation of actions in an infinite population follows an aggregate monotone dynamic.
All Related Versions

This publication has 0 references indexed in Scilit: