Classification Under Partial Reject Options
We study set-valued classification for a Bayesian model where data originates from one of a finite number N of possible hypotheses. Thus we consider the scenario where the size of the classified set of categories ranges from 0 to N. Empty sets corresponds to an outlier, size 1 represents a firm decision that singles out one hypotheses, size N corresponds to a rejection to classify, whereas sizes 2…,N-1 represent a partial rejection, where some hypotheses are excluded from further analysis. We introduce a general framework of reward functions with a set-valued argument and derive the corresponding optimal Bayes classifiers, for a homogeneous block of hypotheses and for when hypotheses are partitioned into blocks, where ambiguity within and between blocks are of different severity. We illustrate classification using an ornithological dataset, with taxa partitioned into blocks and parameters estimated using MCMC. The associated reward function's tuning parameters are chosen through cross-validation.
READ FULL TEXT