Bounded Rationality in Las Vegas: Probabilistic Finite Automata PlayMulti-Armed Bandits

06/30/2020
by   Xinming Liu, et al.
0

While traditional economics assumes that humans are fully rational agents who always maximize their expected utility, in practice, we constantly observe apparently irrational behavior. One explanation is that people have limited computational power, so that they are, quite rationally, making the best decisions they can, given their computational limitations. To test this hypothesis, we consider the multi-armed bandit (MAB) problem. We examine a simple strategy for playing an MAB that can be implemented easily by a probabilistic finite automaton (PFA). Roughly speaking, the PFA sets certain expectations, and plays an arm as long as it meets them. If the PFA has sufficiently many states, it performs near-optimally. Its performance degrades gracefully as the number of states decreases. Moreover, the PFA acts in a "human-like" way, exhibiting a number of standard human biases, like an optimism bias and a negativity bias.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/12/2021

Risk-Averse Biased Human Policies in Assistive Multi-Armed Bandit Settings

Assistive multi-armed bandit problems can be used to model team situatio...
research
05/03/2018

An Asymptotically Optimal Strategy for Constrained Multi-armed Bandit Problems

For the stochastic multi-armed bandit (MAB) problem from a constrained m...
research
07/25/2023

Strategic Play By Resource-Bounded Agents in Security Games

Many studies have shown that humans are "predictably irrational": they d...
research
08/17/2013

Decision Theory with Resource-Bounded Agents

There have been two major lines of research aimed at capturing resource-...
research
01/05/2020

A Hoeffding Inequality for Finite State Markov Chains and its Applications to Markovian Bandits

This paper develops a Hoeffding inequality for the partial sums ∑_k=1^n ...
research
11/05/2019

Response Prediction for Low-Regret Agents

Companies like Google and Microsoft run billions of auctions every day t...
research
04/18/2023

Playing it safe: information constrains collective betting strategies

Every interaction of a living organism with its environment involves the...

Please sign up or login with your details

Forgot password? Click here to reset