Towards Explaining Adversarial Examples Phenomenon in Artificial Neural Networks

07/22/2021
by   Ramin Barati, et al.
0

In this paper, we study the adversarial examples existence and adversarial training from the standpoint of convergence and provide evidence that pointwise convergence in ANNs can explain these observations. The main contribution of our proposal is that it relates the objective of the evasion attacks and adversarial training with concepts already defined in learning theory. Also, we extend and unify some of the other proposals in the literature and provide alternative explanations on the observations made in those proposals. Through different experiments, we demonstrate that the framework is valuable in the study of the phenomenon and is applicable to real-world problems.

READ FULL TEXT
research
05/26/2022

An Analytic Framework for Robust Training of Artificial Neural Networks

The reliability of a learning model is key to the successful deployment ...
research
12/20/2014

Explaining and Harnessing Adversarial Examples

Several machine learning models, including neural networks, consistently...
research
03/25/2019

Robust Neural Networks using Randomized Adversarial Training

Since the discovery of adversarial examples in machine learning, researc...
research
01/26/2016

Unifying Adversarial Training Algorithms with Flexible Deep Data Gradient Regularization

Many previous proposals for adversarial training of deep neural nets hav...
research
09/21/2022

Toy Models of Superposition

Neural networks often pack many unrelated concepts into a single neuron ...
research
02/22/2018

Adversarial Training for Probabilistic Spiking Neural Networks

Classifiers trained using conventional empirical risk minimization or ma...

Please sign up or login with your details

Forgot password? Click here to reset