NICE: Noise Injection and Clamping Estimation for Neural Network Quantization

09/29/2018
by   Chaim Baskin, et al.
0

Convolutional Neural Networks (CNN) are very popular in many fields including computer vision, speech recognition, natural language processing, to name a few. Though deep learning leads to groundbreaking performance in these domains, the networks used are very demanding computationally and are far from real-time even on a GPU, which is not power efficient and therefore does not suit low power systems such as mobile devices. To overcome this challenge, some solutions have been proposed for quantizing the weights and activations of these networks, which accelerate the runtime significantly. Yet, this acceleration comes at the cost of a larger error. The method proposed in this work trains quantized neural networks by noise injection and a learned clamping, which improve the accuracy. This leads to state-of-the-art results on various regression and classification tasks, e.g., ImageNet classification with architectures such as ResNet-18/34/50 with low as 3-bit weights and activations. We implement the proposed solution on an FPGA to demonstrate its applicability for low power real-time applications. The implementation of the paper is available at https://github.com/Lancer555/NICE.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/27/2018

Efficient non-uniform quantizer for quantized neural network targeting reconfigurable hardware

Convolutional Neural Networks (CNN) has become more popular choice for v...
research
07/18/2020

DeepDive: An Integrative Algorithm/Architecture Co-Design for Deep Separable Convolutional Neural Networks

Deep Separable Convolutional Neural Networks (DSCNNs) have become the em...
research
08/22/2018

An Overview of Datatype Quantization Techniques for Convolutional Neural Networks

Convolutional Neural Networks (CNNs) are becoming increasingly popular d...
research
12/18/2017

Automated flow for compressing convolution neural networks for efficient edge-computation with FPGA

Deep convolutional neural networks (CNN) based solutions are the current...
research
04/02/2021

Inference of Recyclable Objects with Convolutional Neural Networks

Population growth in the last decades has resulted in the production of ...
research
04/09/2021

Rock Hunting With Martian Machine Vision

The Mars Perseverance rover applies computer vision for navigation and h...
research
10/18/2021

Sub-bit Neural Networks: Learning to Compress and Accelerate Binary Neural Networks

In the low-bit quantization field, training Binary Neural Networks (BNNs...

Please sign up or login with your details

Forgot password? Click here to reset