A Simple Unified Framework for Detecting Out-of-Distribution Samples and Adversarial Attacks

07/10/2018
by   Kimin Lee, et al.
3

Detecting test samples drawn sufficiently far away from the training distribution statistically or adversarially is a fundamental requirement to deploying a good classifier in many real-world machine learning applications. However, deep neural networks with the softmax classifier are known to produce highly overconfident posterior distributions even for such abnormal samples. In this paper, we propose a simple yet effective method for detecting any abnormal samples, which is applicable to any pre-trained softmax neural classifier. We obtain the class conditionalGaussian distributions with respect to (low- and upper-level) features of the deep models under Gaussian discriminant analysis, which result in a confidence score based on the Mahalanobis distance. While most prior methods have been evaluated for detecting either out-of-distribution or adversarial samples, but not both, the proposed method achieves the state-of-art performances for both cases in our experiments. Moreover, we found that our proposed method is more robust in extreme cases, e.g., when the training dataset has noisy labels or small number of samples. Finally, we show that the proposed method enjoys broader usage by applying it to class incremental learning: whenever out-of-distribution samples are detected, our classification rule can incorporate new classes well without further training deep models.

READ FULL TEXT

page 14

page 15

page 16

page 17

research
07/10/2021

Out of Distribution Detection and Adversarial Attacks on Deep Neural Networks for Robust Medical Image Analysis

Deep learning models have become a popular choice for medical image anal...
research
04/02/2021

Multi-Class Data Description for Out-of-distribution Detection

The capability of reliably detecting out-of-distribution samples is one ...
research
01/29/2023

Learning to reject meets OOD detection: Are all abstentions created equal?

Learning to reject (L2R) and out-of-distribution (OOD) detection are two...
research
02/19/2020

Variational Encoder-based Reliable Classification

Machine learning models provide statistically impressive results which m...
research
02/15/2020

Extreme Classification via Adversarial Softmax Approximation

Training a classifier over a large number of classes, known as 'extreme ...
research
04/23/2021

Lightweight Detection of Out-of-Distribution and Adversarial Samples via Channel Mean Discrepancy

Detecting out-of-distribution (OOD) and adversarial samples is essential...
research
02/16/2021

Unsupervised Energy-based Out-of-distribution Detection using Stiefel-Restricted Kernel Machine

Detecting out-of-distribution (OOD) samples is an essential requirement ...

Please sign up or login with your details

Forgot password? Click here to reset