Efficient Failure Pattern Identification of Predictive Algorithms

06/01/2023
by   Bao Nguyen, et al.
0

Given a (machine learning) classifier and a collection of unlabeled data, how can we efficiently identify misclassification patterns presented in this dataset? To address this problem, we propose a human-machine collaborative framework that consists of a team of human annotators and a sequential recommendation algorithm. The recommendation algorithm is conceptualized as a stochastic sampler that, in each round, queries the annotators a subset of samples for their true labels and obtains the feedback information on whether the samples are misclassified. The sampling mechanism needs to balance between discovering new patterns of misclassification (exploration) and confirming the potential patterns of classification (exploitation). We construct a determinantal point process, whose intensity balances the exploration-exploitation trade-off through the weighted update of the posterior at each round to form the generator of the stochastic sampler. The numerical results empirically demonstrate the competitive performance of our framework on multiple datasets at various signal-to-noise ratios.

READ FULL TEXT

page 12

page 14

research
11/16/2020

CoSam: An Efficient Collaborative Adaptive Sampler for Recommendation

Sampling strategies have been widely applied in many recommendation syst...
research
09/16/2022

Thompson Sampling with Virtual Helping Agents

We address the problem of online sequential decision making, i.e., balan...
research
02/07/2017

Learning what matters - Sampling interesting patterns

In the field of exploratory data mining, local structure in data can be ...
research
09/26/2021

Deep Exploration for Recommendation Systems

We investigate the design of recommendation systems that can efficiently...
research
04/02/2021

Blind Exploration and Exploitation of Stochastic Experts

We present blind exploration and exploitation (BEE) algorithms for ident...
research
05/30/2022

Adaptive Learning for Discovery

In this paper, we study a sequential decision-making problem, called Ada...
research
04/01/2018

SampleAhead: Online Classifier-Sampler Communication for Learning from Synthesized Data

State-of-the-art techniques of artificial intelligence, in particular de...

Please sign up or login with your details

Forgot password? Click here to reset