Humans Can Adopt Optimal Discounting Strategy under Real-Time Constraints

Open Access

1 January 2006

journal article
research article
Published by Public Library of Science (PLoS) in PLoS Computational Biology

Vol. 2 (11) , e152
https://doi.org/10.1371/journal.pcbi.0020152

Abstract

Critical to our many daily choices between larger delayed rewards, and smaller more immediate rewards, are the shape and the steepness of the function that discounts rewards with time. Although research in artificial intelligence favors exponential discounting in uncertain environments, studies with humans and animals have consistently shown hyperbolic discounting. We investigated how humans perform in a reward decision task with temporal constraints, in which each choice affects the time remaining for later trials, and in which the delays vary at each trial. We demonstrated that most of our subjects adopted exponential discounting in this experiment. Further, we confirmed analytically that exponential discounting, with a decay rate comparable to that used by our subjects, maximized the total reward gain in our task. Our results suggest that the particular shape and steepness of temporal discounting is determined by the task that the subject is facing, and question the notion of hyperbolic reward discounting as a universal principle. When we make a choice between two options, we compare the values of their outcomes and select the option with a larger value. However, what if one option leads to a larger delayed reward and the other leads to a smaller more immediate reward? Naturally, we assign a larger value for a larger reward, but it is “discounted” if the reward is to be delivered later. Thus, the value is a monotonically decreasing function of the delays. Previous behavioral studies have repeatedly demonstrated that humans and animals discount delayed rewards hyperbolically. This has practical importance, as hyperbolic discounting can sometimes lead to “irrational” preference reversal: for instance, an individual may prefer two apples in 51 days to one apple in 50 days, but if the days come closer, he prefers one apple today to two apples tomorrow. On the contrary, exponential discounting is always “rational,” as it predicts constant preference. Here, in a new task that mimics animal foraging, and that uses delayed monetary rewards, Schweighofer and colleagues showed that humans can also discount reward exponentially. Furthermore, it is remarkable that by adopting exponential discounting, their subjects maximized their total gain. Thus, depending on the task at hand, the authors' study suggests that humans can flexibly choose the type of reward discounting, and can exhibit rational behavior that maximizes long-term gains.

Keywords

This publication has 37 references indexed in Scilit:

Normative and Descriptive Models of Decision Making: Time Discounting and Risk Sensitivity
Published by Wiley ,2007
Précis of Breakdown of Will
Behavioral and Brain Sciences, 2005
Measuring state changes in human delay discounting: an experiential discounting task
Behavioural Processes, 2004
Impulsivity and rapid discounting of delayed hypothetical rewards in cocaine-dependent individuals.
Experimental and Clinical Psychopharmacology, 2003
Metalearning and neuromodulation
Neural Networks, 2002
Reward discounting as a measure of impulsive behavior in a psychiatric outpatient population.
Experimental and Clinical Psychopharmacology, 2000
Out of Control: Visceral Influences on Behavior
Organizational Behavior and Human Decision Processes, 1996
Modeling Myopic Decisions: Evidence for Hyperbolic Delay-Discounting within Subjects and Amounts
Organizational Behavior and Human Decision Processes, 1995
Anomalies in Intertemporal Choice: Evidence and an Interpretation
The Quarterly Journal of Economics, 1992
Specious reward: A behavioral theory of impulsiveness and impulse control.
Psychological Bulletin, 1975