Raising the Bar on the Evaluation of Out-of-Distribution Detection

by   Jishnu Mukhoti, et al.

In image classification, a lot of development has happened in detecting out-of-distribution (OoD) data. However, most OoD detection methods are evaluated on a standard set of datasets, arbitrarily different from training data. There is no clear definition of what forms a “good" OoD dataset. Furthermore, the state-of-the-art OoD detection methods already achieve near perfect results on these standard benchmarks. In this paper, we define 2 categories of OoD data using the subtly different concepts of perceptual/visual and semantic similarity to in-distribution (iD) data. We define Near OoD samples as perceptually similar but semantically different from iD samples, and Shifted samples as points which are visually different but semantically akin to iD data. We then propose a GAN based framework for generating OoD samples from each of these 2 categories, given an iD dataset. Through extensive experiments on MNIST, CIFAR-10/100 and ImageNet, we show that a) state-of-the-art OoD detection methods which perform exceedingly well on conventional benchmarks are significantly less robust to our proposed benchmark. Moreover, b) models performing well on our setup also perform well on conventional real-world OoD detection benchmarks and vice versa, thereby indicating that one might not even need a separate OoD set, to reliably evaluate performance in OoD detection.


page 2

page 7

page 8

page 9

page 21

page 22

page 23


Meta OOD Learning for Continuously Adaptive OOD Detection

Out-of-distribution (OOD) detection is crucial to modern deep learning a...

Semantically Coherent Out-of-Distribution Detection

Current out-of-distribution (OOD) detection benchmarks are commonly buil...

Augmenting Softmax Information for Selective Classification with Out-of-Distribution Data

Detecting out-of-distribution (OOD) data is a task that is receiving an ...

Estimating Semantic Similarity between In-Domain and Out-of-Domain Samples

Prior work typically describes out-of-domain (OOD) or out-of-distributio...

Block Selection Method for Using Feature Norm in Out-of-distribution Detection

Detecting out-of-distribution (OOD) inputs during the inference stage is...

Natural Attribute-based Shift Detection

Despite the impressive performance of deep networks in vision, language,...

No True State-of-the-Art? OOD Detection Methods are Inconsistent across Datasets

Out-of-distribution detection is an important component of reliable ML s...

Please sign up or login with your details

Forgot password? Click here to reset