Choosing a good toolkit, II: Bayes-rule based heuristics

B-Tier
Journal: Journal of Economic Dynamics and Control
Year: 2020
Volume: 111
Issue: C

Authors (2)

Francetich, Alejandro (not in RePEc) Kreps, David (Stanford University)

Score contribution per author:

1.005 = (α=2.01 / 2 authors) × 1.0x B-tier

α: calibrated so average coauthorship-adjusted count equals average raw count

Abstract

We study heuristics for a class of complex multi-armed bandit problems, the period-by-period choice of a set of objects or “toolkit” where the decision maker learns about the value of tools within the chosen toolkit. This paper studies heuristics that involve a decision maker who employs Bayesian inference. Analytical results are combined with simulations to gain insights into the relative performance of these heuristics. We depart from the extensive bandit-learning literature in computer science and operations research by employing the discounted-expected-reward formulation that stresses the importance of the classic exploration–exploitation tradeoff. A companion paper, Francetich and Kreps   (2019), studies a variety of prior-free heuristics.

Technical Details

RePEc Handle
repec:eee:dyncon:v:111:y:2020:i:c:s0165188918302689
Journal Field
Macro
Author Count
2
Added to Database
2026-01-25