Best-Case Retrieval Evaluation: Improving the Sensitivity of Reciprocal Rank with Lexicographic Precision

06/13/2023
by   Fernando Diaz, et al.
0

Across a variety of ranking tasks, researchers use reciprocal rank to measure the effectiveness for users interested in exactly one relevant item. Despite its widespread use, evidence suggests that reciprocal rank is brittle when discriminating between systems. This brittleness, in turn, is compounded in modern evaluation settings where current, high-precision systems may be difficult to distinguish. We address the lack of sensitivity of reciprocal rank by introducing and connecting it to the concept of best-case retrieval, an evaluation method focusing on assessing the quality of a ranking for the most satisfied possible user across possible recall requirements. This perspective allows us to generalize reciprocal rank and define a new preference-based evaluation we call lexicographic precision or lexiprecision. By mathematical construction, we ensure that lexiprecision preserves differences detected by reciprocal rank, while empirically improving sensitivity and robustness across a broad set of retrieval and recommendation tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/22/2023

Recall as a Measure of Ranking Robustness

Researchers use recall to evaluate rankings across a variety of retrieva...
research
03/02/2018

RankDCG: Rank-Ordering Evaluation Measure

Ranking is used for a wide array of problems, most notably information r...
research
04/25/2022

Offline Retrieval Evaluation Without Evaluation Metrics

Offline evaluation of information retrieval and recommendation has tradi...
research
05/07/2018

An Axiomatic Analysis of Diversity Evaluation Metrics: Introducing the Rank-Biased Utility Metric

Many evaluation metrics have been defined to evaluate the effectiveness ...
research
08/16/2016

Scalable Learning of Non-Decomposable Objectives

Modern retrieval systems are often driven by an underlying machine learn...
research
01/21/2021

Assessing the Benefits of Model Ensembles in Neural Re-Ranking for Passage Retrieval

Our work aimed at experimentally assessing the benefits of model ensembl...
research
05/27/2019

Minimizing Time-to-Rank: A Learning and Recommendation Approach

Consider the following problem faced by an online voting platform: A use...

Please sign up or login with your details

Forgot password? Click here to reset