Robust Explainability: A Tutorial on Gradient-Based Attribution Methods for Deep Neural Networks

07/23/2021
by   Ian E. Nielsen, et al.
0

With the rise of deep neural networks, the challenge of explaining the predictions of these networks has become increasingly recognized. While many methods for explaining the decisions of deep neural networks exist, there is currently no consensus on how to evaluate them. On the other hand, robustness is a popular topic for deep learning research; however, it is hardly talked about in explainability until very recently. In this tutorial paper, we start by presenting gradient-based interpretability methods. These techniques use gradient signals to assign the burden of the decision on the input features. Later, we discuss how gradient-based methods can be evaluated for their robustness and the role that adversarial robustness plays in having meaningful explanations. We also discuss the limitations of gradient-based methods. Finally, we present the best practices and attributes that should be examined before choosing an explainability method. We conclude with the future directions for research in the area at the convergence of robustness and explainability.

READ FULL TEXT

page 9

page 13

page 15

research
07/18/2023

Gradient strikes back: How filtering out high frequencies improves explanations

Recent years have witnessed an explosion in the development of novel pre...
research
05/25/2022

Gradient-based explanations for Gaussian Process regression and classification models

Gaussian Processes (GPs) have proven themselves as a reliable and effect...
research
05/31/2019

Explainability Techniques for Graph Convolutional Networks

Graph Networks are used to make decisions in potentially complex scenari...
research
05/04/2023

Distributing Synergy Functions: Unifying Game-Theoretic Interaction Methods for Machine-Learning Explainability

Deep learning has revolutionized many areas of machine learning, from co...
research
08/21/2020

Topological Gradient-based Competitive Learning

Topological learning is a wide research area aiming at uncovering the mu...
research
04/21/2019

Explaining a prediction in some nonlinear models

In this article we will analyse how to compute the contribution of each ...
research
12/03/2020

Visualization of Supervised and Self-Supervised Neural Networks via Attribution Guided Factorization

Neural network visualization techniques mark image locations by their re...

Please sign up or login with your details

Forgot password? Click here to reset