EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks

05/28/2019
by   Mingxing Tan, et al.
1

Convolutional Neural Networks (ConvNets) are commonly developed at a fixed resource budget, and then scaled up for better accuracy if more resources are available. In this paper, we systematically study model scaling and identify that carefully balancing network depth, width, and resolution can lead to better performance. Based on this observation, we propose a new scaling method that uniformly scales all dimensions of depth/width/resolution using a simple yet highly effective compound coefficient. We demonstrate the effectiveness of this method on scaling up MobileNets and ResNet. To go even further, we use neural architecture search to design a new baseline network and scale it up to obtain a family of models, called EfficientNets, which achieve much better accuracy and efficiency than previous ConvNets. In particular, our EfficientNet-B7 achieves state-of-the-art 84.4 top-1 / 97.1 faster on inference than the best existing ConvNet. Our EfficientNets also transfer well and achieve state-of-the-art accuracy on CIFAR-100 (91.7 Flowers (98.8 magnitude fewer parameters. Source code is at https://github.com/tensorflow/tpu/tree/master/models/official/efficientnet.

READ FULL TEXT
research
11/08/2020

Analysis of Dimensional Influence of Convolutional Neural Networks for Histopathological Cancer Classification

Convolutional Neural Networks can be designed with different levels of c...
research
11/30/2020

SplitNet: Divide and Co-training

The width of a neural network matters since increasing the width will ne...
research
11/16/2020

Scaled-YOLOv4: Scaling Cross Stage Partial Network

We show that the YOLOv4 object detection neural network based on the CSP...
research
09/16/2020

EfficientNet-eLite: Extremely Lightweight and Efficient CNN Models for Edge Devices by Network Candidate Search

Embedding Convolutional Neural Network (CNN) into edge devices for infer...
research
04/01/2021

EfficientNetV2: Smaller Models and Faster Training

This paper introduces EfficientNetV2, a new family of convolutional netw...
research
07/31/2021

Greedy Network Enlarging

Recent studies on deep convolutional neural networks present a simple pa...
research
03/30/2020

TResNet: High Performance GPU-Dedicated Architecture

Many deep learning models, developed in recent years, reach higher Image...

Please sign up or login with your details

Forgot password? Click here to reset