Non-local Neural Networks
November 21, 2017 ยท Entered Twilight ยท ๐ 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition
"Last commit was 7.0 years ago (โฅ5 year threshold)"
Evidence collected by the PWNC Scanner
Repo contents: CODE_OF_CONDUCT.md, CONTRIBUTING.md, DATASET.md, INSTALL.md, LICENSE, README.md, caffe2_customized_ops, configs, data, lib, process_data, scripts, tools
Authors
Xiaolong Wang, Ross Girshick, Abhinav Gupta, Kaiming He
arXiv ID
1711.07971
Category
cs.CV: Computer Vision
Citations
9.9K
Venue
2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition
Repository
https://github.com/facebookresearch/video-nonlocal-net
โญ 1996
Last Checked
1 month ago
Abstract
Both convolutional and recurrent operations are building blocks that process one local neighborhood at a time. In this paper, we present non-local operations as a generic family of building blocks for capturing long-range dependencies. Inspired by the classical non-local means method in computer vision, our non-local operation computes the response at a position as a weighted sum of the features at all positions. This building block can be plugged into many computer vision architectures. On the task of video classification, even without any bells and whistles, our non-local models can compete or outperform current competition winners on both Kinetics and Charades datasets. In static image recognition, our non-local models improve object detection/segmentation and pose estimation on the COCO suite of tasks. Code is available at https://github.com/facebookresearch/video-nonlocal-net .
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ Computer Vision
๐
๐
Old Age
๐
๐
Old Age
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
R.I.P.
๐ป
Ghosted
You Only Look Once: Unified, Real-Time Object Detection
๐
๐
Old Age
SSD: Single Shot MultiBox Detector
๐
๐
Old Age
Squeeze-and-Excitation Networks
R.I.P.
๐ป
Ghosted