Squeeze-and-Excitation Networks
September 05, 2017 ยท Entered Twilight ยท ๐ 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition
"Last commit was 6.0 years ago (โฅ5 year threshold)"
Evidence collected by the PWNC Scanner
Repo contents: .gitignore, ILSVRC2017_val.txt, LICENSE, README.md, figures, include, models, src
Authors
Jie Hu, Li Shen, Samuel Albanie, Gang Sun, Enhua Wu
arXiv ID
1709.01507
Category
cs.CV: Computer Vision
Citations
32.3K
Venue
2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition
Repository
https://github.com/hujie-frank/SENet
โญ 3610
Last Checked
1 month ago
Abstract
The central building block of convolutional neural networks (CNNs) is the convolution operator, which enables networks to construct informative features by fusing both spatial and channel-wise information within local receptive fields at each layer. A broad range of prior research has investigated the spatial component of this relationship, seeking to strengthen the representational power of a CNN by enhancing the quality of spatial encodings throughout its feature hierarchy. In this work, we focus instead on the channel relationship and propose a novel architectural unit, which we term the "Squeeze-and-Excitation" (SE) block, that adaptively recalibrates channel-wise feature responses by explicitly modelling interdependencies between channels. We show that these blocks can be stacked together to form SENet architectures that generalise extremely effectively across different datasets. We further demonstrate that SE blocks bring significant improvements in performance for existing state-of-the-art CNNs at slight additional computational cost. Squeeze-and-Excitation Networks formed the foundation of our ILSVRC 2017 classification submission which won first place and reduced the top-5 error to 2.251%, surpassing the winning entry of 2016 by a relative improvement of ~25%. Models and code are available at https://github.com/hujie-frank/SENet.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ Computer Vision
๐
๐
Old Age
๐
๐
Old Age
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
R.I.P.
๐ป
Ghosted
You Only Look Once: Unified, Real-Time Object Detection
๐
๐
Old Age
SSD: Single Shot MultiBox Detector
R.I.P.
๐ป
Ghosted
Rethinking the Inception Architecture for Computer Vision
๐
๐
Old Age