Nonparametric learning rules from bandit experiments: The eyes have it!

B-Tier
Journal: Games and Economic Behavior
Year: 2013
Volume: 81
Issue: C
Pages: 215-231

Score contribution per author:

0.670 = (α=2.01 / 3 authors) × 1.0x B-tier

α: calibrated so average coauthorship-adjusted count equals average raw count

Abstract

How do people learn? We assess, in a model-free manner, subjectsʼ belief dynamics in a two-armed bandit learning experiment. A novel feature of our approach is to supplement the choice and reward data with subjectsʼ eye movements during the experiment to pin down estimates of subjectsʼ beliefs. Estimates show that subjects are more reluctant to “update down” following unsuccessful choices, than “update up” following successful choices. The profits from following the estimated learning and decision rules are smaller (by about 25% of average earnings by subjects in this experiment) than what would be obtained from a fully-rational Bayesian learning model, but comparable to the profits from alternative non-Bayesian learning models, including reinforcement learning and a simple “win-stay” choice heuristic.

Technical Details

RePEc Handle
repec:eee:gamebe:v:81:y:2013:i:c:p:215-231
Journal Field
Theory
Author Count
3
Added to Database
2026-01-29