TopRank: A practical algorithm for online stochastic ranking

06/06/2018
by   Tor Lattimore, et al.
2

Online learning to rank is a sequential decision-making problem where in each round the learning agent chooses a list of items and receives feedback in the form of clicks from the user. Many sample-efficient algorithms have been proposed for this problem that assume a specific click model connecting rankings and user behavior. We propose a generalized click model that encompasses many existing models, including the position-based and cascade models. Our generalization motivates a novel online learning algorithm based on topological sort, which we call TopRank. TopRank is (a) more natural than existing algorithms, (b) has stronger regret guarantees than existing algorithms with comparable generality, (c) has a more insightful proof that leaves the door open to many generalizations, (d) outperforms existing algorithms empirically.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/07/2017

Online Learning to Rank in Stochastic Click Models

Online learning to rank is a core problem in information retrieval and m...
research
11/01/2018

Online Diverse Learning to Rank from Partial-Click Feedback

Learning to rank is an important problem in machine learning and recomme...
research
05/26/2023

Adversarial Attacks on Online Learning to Rank with Click Feedback

Online learning to rank (OLTR) is a sequential decision-making problem w...
research
05/18/2012

Online Structured Prediction via Coactive Learning

We propose Coactive Learning as a model of interaction between a learnin...
research
11/05/2020

Efficient Online Learning of Optimal Rankings: Dimensionality Reduction via Gradient Descent

We consider a natural model of online preference aggregation, where sets...
research
02/04/2019

Online Multiclass Classification Based on Prediction Margin for Partial Feedback

We consider the problem of online multiclass classification with partial...
research
11/03/2011

Online Learning with Preference Feedback

We propose a new online learning model for learning with preference feed...

Please sign up or login with your details

Forgot password? Click here to reset