FastMask: Segment Multi-scale Object Candidates in One Shot

December 28, 2016 ยท Entered Twilight ยท ๐Ÿ› Computer Vision and Pattern Recognition

๐ŸŒ… TWILIGHT: Old Age
Predates the code-sharing era โ€” a pioneer of its time

"Last commit was 8.0 years ago (โ‰ฅ5 year threshold)"

Evidence collected by the PWNC Scanner

Repo contents: .gitignore, .gitmodules, LICENSE, README.md, alchemy, caffe-fm, config.py, configs, demo.py, evalCOCO.py, image_demo.py, models, python_layers, redis.conf, requirements.txt, spiders, test.py, train.py, utils.py, video_demo.py

Authors Hexiang Hu, Shiyi Lan, Yuning Jiang, Zhimin Cao, Fei Sha arXiv ID 1612.08843 Category cs.CV: Computer Vision Cross-listed cs.AI Citations 28 Venue Computer Vision and Pattern Recognition Repository https://github.com/voidrank/FastMask โญ 212 Last Checked 1 month ago
Abstract
Objects appear to scale differently in natural images. This fact requires methods dealing with object-centric tasks (e.g. object proposal) to have robust performance over variances in object scales. In the paper, we present a novel segment proposal framework, namely FastMask, which takes advantage of hierarchical features in deep convolutional neural networks to segment multi-scale objects in one shot. Innovatively, we adapt segment proposal network into three different functional components (body, neck and head). We further propose a weight-shared residual neck module as well as a scale-tolerant attentional head module for efficient one-shot inference. On MS COCO benchmark, the proposed FastMask outperforms all state-of-the-art segment proposal methods in average recall being 2~5 times faster. Moreover, with a slight trade-off in accuracy, FastMask can segment objects in near real time (~13 fps) with 800*600 resolution images, demonstrating its potential in practical applications. Our implementation is available on https://github.com/voidrank/FastMask.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

๐Ÿ“œ Similar Papers

In the same crypt โ€” Computer Vision