Estimating the Accuracies of Multiple Classifiers Without Labeled Data

07/29/2014
by   Ariel Jaffe, et al.
0

In various situations one is given only the predictions of multiple classifiers over a large unlabeled test data. This scenario raises the following questions: Without any labeled data and without any a-priori knowledge about the reliability of these different classifiers, is it possible to consistently and computationally efficiently estimate their accuracies? Furthermore, also in a completely unsupervised manner, can one construct a more accurate unsupervised ensemble classifier? In this paper, focusing on the binary case, we present simple, computationally efficient algorithms to solve these questions. Furthermore, under standard classifier independence assumptions, we prove our methods are consistent and study their asymptotic error. Our approach is spectral, based on the fact that the off-diagonal entries of the classifiers' covariance matrix and 3-d tensor are rank-one. We illustrate the competitive performance of our algorithms via extensive experiments on both artificial and real datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/13/2013

Ranking and combining multiple predictors without labeled data

In a broad range of classification and decision making problems, one is ...
research
10/20/2015

Unsupervised Ensemble Learning with Dependent Classifiers

In unsupervised ensemble learning, one obtains predictions from multiple...
research
02/13/2018

Unsupervised Evaluation and Weighted Aggregation of Ranked Predictions

Learning algorithms that aggregate predictions from an ensemble of diver...
research
07/09/2021

Specialists Outperform Generalists in Ensemble Classification

Consider an ensemble of k individual classifiers whose accuracies are kn...
research
03/25/2016

Object Recognition Based on Amounts of Unlabeled Data

This paper proposes a novel semi-supervised method on object recognition...
research
03/08/2017

Unsupervised Ensemble Regression

Consider a regression problem where there is no labeled data and the onl...
research
03/07/2018

Sequential Maximum Margin Classifiers for Partially Labeled Data

In many real-world applications, data is not collected as one batch, but...

Please sign up or login with your details

Forgot password? Click here to reset