Quantifying the Uncertainty of Precision Estimates for Rule based Text Classifiers

05/19/2020
by   James Nutaro, et al.
0

Rule based classifiers that use the presence and absence of key sub-strings to make classification decisions have a natural mechanism for quantifying the uncertainty of their precision. For a binary classifier, the key insight is to treat partitions of the sub-string set induced by the documents as Bernoulli random variables. The mean value of each random variable is an estimate of the classifier's precision when presented with a document inducing that partition. These means can be compared, using standard statistical tests, to a desired or expected classifier precision. A set of binary classifiers can be combined into a single, multi-label classifier by an application of the Dempster-Shafer theory of evidence. The utility of this approach is demonstrated with a benchmark problem.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/08/2011

Fuzzy Rules and Evidence Theory for Satellite Image Analysis

Design of a fuzzy rule based classifier is proposed. The performance of ...
research
11/30/2018

Learning Interpretable Rules for Multi-label Classification

Multi-label classification (MLC) is a supervised learning problem in whi...
research
06/19/2020

Classifier uncertainty: evidence, potential impact, and probabilistic treatment

Classifiers are often tested on relatively small data sets, which should...
research
07/16/2020

Conformal Rule-Based Multi-label Classification

We advocate the use of conformal prediction (CP) to enhance rule-based m...
research
06/30/2011

A New Technique for Combining Multiple Classifiers using The Dempster-Shafer Theory of Evidence

This paper presents a new classifier combination technique based on the ...
research
06/12/2015

Knowledge Representation in Learning Classifier Systems: A Review

Knowledge representation is a key component to the success of all rule b...
research
10/04/2017

Constructing multi-modality and multi-classifier radiomics predictive models through reliable classifier fusion

Radiomics aims to extract and analyze large numbers of quantitative feat...

Please sign up or login with your details

Forgot password? Click here to reset