b'Elad Hoffer'

research

∙ 06/18/2023

DropCompute: simple and more robust distributed synchronous training via compute variance reduction

Background: Distributed training is essential for large scale training o...

0 Niv Giladi, et al. ∙

research

∙ 02/06/2022

Energy awareness in low precision neural networks

Power consumption is a major obstacle in the deployment of deep neural n...

0 Nurit Spingarn-Eliezer, et al. ∙

research

∙ 12/19/2021

Logarithmic Unbiased Quantization: Practical 4-bit Training in Deep Learning

Quantization of the weights and activations is one of the main methods t...

0 Brian Chmiel, et al. ∙

research

∙ 10/01/2020

Task Agnostic Continual Learning Using Online Variational Bayes with Fixed-Point Updates

Background: Catastrophic forgetting is the notorious vulnerability of ne...

0 Chen Zeno, et al. ∙

research

∙ 06/15/2020

Neural gradients are lognormally distributed: understanding sparse and quantized training

Neural gradient compression remains a main bottleneck in improving train...

0 Brian Chmiel, et al. ∙

research

∙ 12/03/2019

The Knowledge Within: Methods for Data-Free Model Compression

Background: Recently, an extensive amount of research has been focused o...

0 Matan Haroush, et al. ∙

research

∙ 09/26/2019

At Stability's Edge: How to Adjust Hyperparameters to Preserve Minima Selection in Asynchronous Training of Neural Networks?

Background: Recent developments have made it possible to accelerate neur...

0 Niv Giladi, et al. ∙

research

∙ 01/27/2019

Augment your batch: better training with larger batches

Large-batch SGD is important for scaling training of deep neural network...

0 Elad Hoffer, et al. ∙

research

∙ 10/02/2018

ACIQ: Analytical Clipping for Integer Quantization of neural networks

Unlike traditional approaches that focus on the quantization at the netw...

0 Ron Banner, et al. ∙

research

∙ 05/25/2018

Scalable Methods for 8-bit Training of Neural Networks

Quantized Neural Networks (QNNs) are often used to improve network effic...

0 Ron Banner, et al. ∙

research

∙ 03/27/2018

Bayesian Gradient Descent: Online Variational Bayes Learning with Increased Robustness to Catastrophic Forgetting and Weight Pruning

We suggest a novel approach for the estimation of the posterior distribu...

0 Chen Zeno, et al. ∙

research

∙ 03/05/2018

Norm matters: efficient and accurate normalization schemes in deep networks

Over the past few years batch-normalization has been commonly used in de...

0 Elad Hoffer, et al. ∙

research

∙ 02/14/2018

On the Blindspots of Convolutional Networks

Deep convolutional network has been the state-of-the-art approach for a ...

0 Elad Hoffer, et al. ∙

research

∙ 01/14/2018

Fix your classifier: the marginal value of training the last weight layer

Neural networks are commonly used as models for classification for a wid...

0 Elad Hoffer, et al. ∙

research

∙ 10/27/2017

The Implicit Bias of Gradient Descent on Separable Data

We show that gradient descent on an unregularized logistic regression pr...

0 Daniel Soudry, et al. ∙

research

∙ 05/24/2017

Train longer, generalize better: closing the generalization gap in large batch training of neural networks

Background: Deep learning models are typically trained using stochastic ...

0 Elad Hoffer, et al. ∙

research

∙ 02/19/2017

Exponentially vanishing sub-optimal local minima in multilayer neural networks

Background: Statistical mechanics results (Dauphin et al. (2014); Chorom...

0 Daniel Soudry, et al. ∙

research

∙ 10/02/2016

Deep unsupervised learning through spatial contrasting

Convolutional networks have marked their place over the last few years a...

0 Elad Hoffer, et al. ∙

research

∙ 12/20/2014

Deep metric learning using Triplet network

Deep learning has proven itself as a successful set of models for learni...

0 Elad Hoffer, et al. ∙

Elad Hoffer

Featured Co-authors

Sign in with Google

Consider DeepAI Pro