Confidence Intervals for Error Rates in Matching Tasks: Critical Review and Recommendations

06/01/2023
by   Riccardo Fogliato, et al.
0

Matching algorithms are commonly used to predict matches between items in a collection. For example, in 1:1 face verification, a matching algorithm predicts whether two face images depict the same person. Accurately assessing the uncertainty of the error rates of such algorithms can be challenging when data are dependent and error rates are low, two aspects that have been often overlooked in the literature. In this work, we review methods for constructing confidence intervals for error rates in matching tasks such as 1:1 face verification. We derive and examine the statistical properties of these methods and demonstrate how coverage and interval width vary with sample size, error rates, and degree of data dependence using both synthetic and real-world datasets. Based on our findings, we provide recommendations for best practices for constructing confidence intervals for error rates in matching tasks.

READ FULL TEXT
research
01/26/2022

Confidence Intervals for the Generalisation Error of Random Forests

Out-of-bag error is commonly used as an estimate of generalisation error...
research
02/07/2022

Valid confidence intervals for μ, σ when there is only one observation available

Portnoy (2019) considered the problem of constructing an optimal confide...
research
04/16/2018

Confidence intervals for the area under the receiver operating characteristic curve in the presence of ignorable missing data

Receiver operating characteristic (ROC) curves are widely used as a meas...
research
04/10/2019

Practical Valid Inferences for the Two-Sample Binomial Problem

Consider comparing two independent binomial responses. Our interest is w...
research
12/11/2021

Confidence intervals for the random forest generalization error

We show that underneath the training process of a random forest there li...
research
08/10/2020

Rapid Approximate Aggregation with Distribution-Sensitive Interval Guarantees

Aggregating data is fundamental to data analytics, data exploration, and...
research
07/04/2022

Cost-Efficient Fixed-Width Confidence Intervals for the Difference of Two Bernoulli Proportions

We study properties of confidence intervals (CIs) for the difference of ...

Please sign up or login with your details

Forgot password? Click here to reset