Stacked U-Nets: A No-Frills Approach to Natural Image Segmentation

April 27, 2018 · Entered Twilight · 🏛 arXiv.org

"Last commit was 7.0 years ago (≥5 year threshold)"

Evidence collected by the PWNC Scanner

Repo contents: .gitignore, LICENSE, README.md, __init__.py, config.json, display.py, evaluate_pascal.py, ptsemseg, requirements.txt, test_multiscale.py, train_imagenet.py, train_seg.py, viz_net_pytorch.py

Authors Sohil Shah, Pallabi Ghosh, Larry S Davis, Tom Goldstein arXiv ID 1804.10343 Category cs.CV: Computer Vision Cross-listed cs.NE Citations 58 Venue arXiv.org Repository https://github.com/shahsohil/sunets ⭐ 153 Last Checked 1 month ago

Abstract

Many imaging tasks require global information about all pixels in an image. Conventional bottom-up classification networks globalize information by decreasing resolution; features are pooled and downsampled into a single output. But for semantic segmentation and object detection tasks, a network must provide higher-resolution pixel-level outputs. To globalize information while preserving resolution, many researchers propose the inclusion of sophisticated auxiliary blocks, but these come at the cost of a considerable increase in network size and computational cost. This paper proposes stacked u-nets (SUNets), which iteratively combine features from different resolution scales while maintaining resolution. SUNets leverage the information globalization power of u-nets in a deeper network architectures that is capable of handling the complexity of natural images. SUNets perform extremely well on semantic segmentation tasks using a small number of parameters.