Online learning in repeated auctions

11/18/2015
by   Jonathan Weed, et al.
0

Motivated by online advertising auctions, we consider repeated Vickrey auctions where goods of unknown value are sold sequentially and bidders only learn (potentially noisy) information about a good's value once it is purchased. We adopt an online learning approach with bandit feedback to model this problem and derive bidding strategies for two models: stochastic and adversarial. In the stochastic model, the observed values of the goods are random variables centered around the true value of the good. In this case, logarithmic regret is achievable when competing against well behaved adversaries. In the adversarial model, the goods need not be identical and we simply compare our performance against that of the best fixed bid in hindsight. We show that sublinear regret is also achievable in this case and prove matching minimax lower bounds. To our knowledge, this is the first complete set of strategies for bidders participating in auctions of this type.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/22/2018

Adversarial Online Learning with noise

We present and study models of adversarial online learning where the fee...
research
03/09/2023

On the Value of Stochastic Side Information in Online Learning

We study the effectiveness of stochastic side information in determinist...
research
06/23/2021

Best-Case Lower Bounds in Online Learning

Much of the work in online learning focuses on the study of sublinear up...
research
11/03/2017

Learning to Bid Without Knowing your Value

We address online learning in complex auction settings, such as sponsore...
research
04/04/2023

Online Learning with Adversaries: A Differential Inclusion Analysis

We consider the measurement model Y = AX, where X and, hence, Y are rand...
research
06/03/2021

Bandit Phase Retrieval

We study a bandit version of phase retrieval where the learner chooses a...
research
04/27/2011

Online Learning: Stochastic and Constrained Adversaries

Learning theory has largely focused on two main learning scenarios. The ...

Please sign up or login with your details

Forgot password? Click here to reset