Interpretable Explanations of Black Boxes by Meaningful Perturbation

04/11/2017
by   Ruth Fong, et al.
0

As machine learning algorithms are increasingly applied to high impact yet high risk tasks, e.g. problems in health, it is critical that researchers can explain how such algorithms arrived at their predictions. In recent years, a number of image saliency methods have been developed to summarize where highly complex neural networks "look" in an image for evidence for their predictions. However, these techniques are limited by their heuristic nature and architectural constraints. In this paper, we make two main contributions: First, we propose a general framework for learning different kinds of explanations for any black box algorithm. Second, we introduce a paradigm that learns the minimally salient part of an image by directly editing it and learning from the corresponding changes to its output. Unlike previous works, our method is model-agnostic and testable because it is grounded in replicable image perturbations.

READ FULL TEXT

page 1

page 3

page 4

page 6

page 7

page 8

research
11/22/2016

Programs as Black-Box Explanations

Recent work in model-agnostic explanations of black-box machine learning...
research
09/12/2020

MeLIME: Meaningful Local Explanation for Machine Learning Models

Most state-of-the-art machine learning algorithms induce black-box model...
research
09/04/2020

Towards Musically Meaningful Explanations Using Source Separation

Deep neural networks (DNNs) are successfully applied in a wide variety o...
research
04/05/2018

Explanations of model predictions with live and breakDown packages

Complex models are commonly used in predictive modeling. In this paper w...
research
07/07/2021

Recurrence-Aware Long-Term Cognitive Network for Explainable Pattern Classification

Machine learning solutions for pattern classification problems are nowad...
research
11/30/2018

An Interpretable Model with Globally Consistent Explanations for Credit Risk

We propose a possible solution to a public challenge posed by the Fair I...
research
02/07/2021

Bandits for Learning to Explain from Explanations

We introduce Explearn, an online algorithm that learns to jointly output...

Please sign up or login with your details

Forgot password? Click here to reset