paper brief

EfficientNet (arXiv:1905.11946) — Compound Scaling for Convolutional Neural Networks

EfficientNet studies how to scale convolutional neural networks when more compute is available, rather than designing models only for a fixed resource budget. The paper reports that carefully balancing network depth, width, and resolution yields better performance, and it introduces a compound coefficient that uniformly scales these three dimensions.

February 25, 2026•Mira Vale•ml foundations

Continue in Rorobot with the source paper open and ready for chat.

Open this paper in Rorobot

What this paper is about

Convolutional neural networks are commonly developed at a fixed resource budget, and then scaled up for better accuracy when more resources are available.[S1] This paper reports a systematic study of model scaling and states that carefully balancing network depth, width, and resolution can lead to better performance.[S1] Based on that observation, the paper proposes a scaling method that uniformly scales depth, width, and resolution using a simple compound coefficient.[S1] The paper demonstrates the effectiveness of this compound scaling method by scaling up MobileNets and ResNet models.[S1] The paper also reports using neural architecture search to design a new baseline network and then scaling that baseline to obtain a family of models called EfficientNets.[S1] The paper states that the EfficientNet family achieves much better accuracy and efficiency than previous convolutional networks.[S1] The paper reports that EfficientNet-B7 achieves state-of-the-art 84.3% top-1 accuracy.[S1]

Core claims to remember

The paper states that convolutional networks are often built for a fixed resource budget and are later scaled up if more resources become available.[S1] The paper reports that model scaling can be studied systematically rather than treated as an ad hoc step after a model is designed.[S1] The paper reports identifying that carefully balancing depth, width, and resolution leads to better performance.[S1] The paper proposes a compound scaling method that uniformly scales all three of depth, width, and resolution using a simple coefficient.[S1] The paper demonstrates the method by applying it to scale up MobileNets and ResNet architectures.[S1] The paper reports using neural architecture search to produce a new baseline network and then scaling it to create a family of EfficientNet models.[S1] The paper states that these EfficientNets deliver much better accuracy and efficiency than previous convolutional networks.[S1] The paper reports that EfficientNet-B7 reaches 84.3% top-1 accuracy and describes this as state of the art.[S1]

Limitations and caveats

The paper’s demonstrations of the proposed compound scaling method include scaling up MobileNets and ResNet models.[S1] The paper also reports a second step that uses neural architecture search to design a new baseline network before scaling it into the EfficientNet family.[S1] The paper reports a specific top-line result for EfficientNet-B7 of 84.3% top-1 accuracy.[S1] The paper presents its contribution in the context of scaling convolutional networks across depth, width, and resolution.[S1]

How to apply this in study or projects

Read the paper’s description of the common workflow where convolutional networks are developed at a fixed resource budget and then scaled up when more resources are available.[S1] Trace the paper’s systematic study of model scaling and the reported finding that balancing depth, width, and resolution can improve performance.[S1] Write down the paper’s definition of its compound scaling method as uniformly scaling depth, width, and resolution using a simple coefficient.[S1] Follow the paper’s reported demonstrations by examining how the compound scaling method is applied to scale up MobileNets and ResNet.[S1] Compare the paper’s two routes to larger models by separating the scaling of existing architectures from the route that uses neural architecture search to design a new baseline and then scales it into the EfficientNet family.[S1] Record the paper’s reported headline metric for EfficientNet-B7, including the stated 84.3% top-1 accuracy and the paper’s characterization of it as state of the art.[S1]

Sources

[S1]arxiv.org
EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks
Convolutional Neural Networks (ConvNets) are commonly developed at a fixed resource budget, and then scaled up for better accuracy if more resources are available. In this paper, we systematically study model scaling and identify that carefully balancing network depth, width, and resolution can lead to better performance. Based on this observation, we propose a new scaling method that uniformly scales all dimensions of depth/width/resolution using a simple yet highly effective compound coefficient. We demonstrate the effectiveness of this method on scaling up MobileNets and ResNet. To go even further, we use neural architecture search to design a new baseline network and scale it up to obtain a family of models, called EfficientNets, which achieve much better accuracy and efficiency than previous ConvNets. In particular, our EfficientNet-B7 achieves state-of-the-art 84.3% top-1 accuracy on ImageNet, while being 8.4x smaller and 6.1x faster on inference than the best existing ConvNet. Our EfficientNets also transfer well and achieve state-of-the-art accuracy on CIFAR-100 (91.7%), Flowers (98.8%), and 3 other transfer learning datasets, with an order of magnitude fewer parameters. Source code is at https://github.com/tensorflow/tpu/tree/master/models/official/efficientnet.
Open source Back to article

FAQ

What is the main idea behind EfficientNet scaling?

The paper reports that carefully balancing network depth, width, and resolution leads to better performance when scaling convolutional neural networks.[S1] The paper proposes a compound scaling method that uniformly scales depth, width, and resolution using a simple coefficient.[S1]

What results does the paper report for EfficientNet-B7?

The paper reports that EfficientNet-B7 achieves state-of-the-art 84.3% top-1 accuracy.[S1] The paper presents EfficientNet-B7 as part of a family of models obtained by designing a baseline network with neural architecture search and then scaling it up.[S1]