Conditional Universal Consistency

Preprint

preprint

Published in RePEc

Abstract

Players choose an action before learning an outcome chosen according to an unknown and history-dependent stochastic rule. Procedures that categorize outcomes, and use a randomized variation on fictitious play within each category are studied. These procedures are â€œconditionally consistent:â€ they yield almost as high a time-average payoff as if the player knew the conditional distributions of actions given categories. Moreover, given any alternative procedure, there is a conditionally consistent procedure whose performance is no more than epsilon worse regardless of the discount factor. We also discuss cycles, and argue that the time-average of play should resemble a correlated equilibrium.

Keywords

CONDITIONALLY CONSISTENT
CONDITIONAL UNIVERSAL
UNIVERSAL CONSISTENCY
LEARNING AN OUTCOME
CONSISTENCY PLAYERS
PLAYERS CHOOSE
OUTCOME CHOSEN
STOCHASTIC
LEARNING

All Related Versions

Version 1, RePEc
Version 1, RePEc (Unconfirmed version)

This publication has 0 references indexed in Scilit: