Identifying Critical Neurons in ANN Architectures using Mixed Integer Programming

02/17/2020
by   Mostafa Elaraby, et al.
103

We introduce a novel approach to optimize the architecture of deep neural networks by identifying critical neurons and removing non-critical ones. The proposed approach utilizes a mixed integer programming (MIP) formulation of neural models which includes a continuous importance score computed for each neuron in the network. The optimization in MIP solver minimizes the number of critical neurons (i.e., with high importance score) that need to be kept for maintaining the overall accuracy of the model. Further, the proposed formulation generalizes the recently considered lottery ticket optimization by identifying multiple "lucky" sub-networks resulting in optimized architecture that not only perform well on a single dataset, but also generalize across multiple ones upon retraining of network weights. Finally, the proposed framework provides significant improvement in scalability of automatic sparsification of deep network architectures compared to previous attempts. We validate the performance and generalizability of our approach on MNIST, Fashion-MNIST, and CIFAR-10 datasets, using three different neural networks: LeNet 5 and two ReLU fully connected models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/21/2021

Efficient and Robust Mixed-Integer Optimization Methods for Training Binarized Deep Neural Networks

Compared to classical deep neural networks its binarized versions can be...
research
08/19/2020

ReLU activated Multi-Layer Neural Networks trained with Mixed Integer Linear Programs

This paper is a case study to demonstrate that, in principle, multi-laye...
research
03/11/2022

A Mixed Integer Programming Approach for Verifying Properties of Binarized Neural Networks

Many approaches for verifying input-output properties of neural networks...
research
01/03/2022

A Mixed Integer Programming Approach to Training Dense Neural Networks

Artificial Neural Networks (ANNs) are prevalent machine learning models ...
research
08/29/2018

Unified Receiver Design in Wireless Relay Networks Using Mixed-Integer Programming Techniques

Wireless receiver design is critical to the overall system performance. ...
research
11/17/2015

Learning Neural Network Architectures using Backpropagation

Deep neural networks with millions of parameters are at the heart of man...
research
06/17/2018

How Could Polyhedral Theory Harness Deep Learning?

The holy grail of deep learning is to come up with an automatic method t...

Please sign up or login with your details

Forgot password? Click here to reset