Stacked U-Nets: A No-Frills Approach to Natural Image Segmentation

April 27, 2018 ยท Entered Twilight ยท ๐Ÿ› arXiv.org

๐ŸŒ… TWILIGHT: Old Age
Predates the code-sharing era โ€” a pioneer of its time

"Last commit was 7.0 years ago (โ‰ฅ5 year threshold)"

Evidence collected by the PWNC Scanner

Repo contents: .gitignore, LICENSE, README.md, __init__.py, config.json, display.py, evaluate_pascal.py, ptsemseg, requirements.txt, test_multiscale.py, train_imagenet.py, train_seg.py, viz_net_pytorch.py

Authors Sohil Shah, Pallabi Ghosh, Larry S Davis, Tom Goldstein arXiv ID 1804.10343 Category cs.CV: Computer Vision Cross-listed cs.NE Citations 58 Venue arXiv.org Repository https://github.com/shahsohil/sunets โญ 153 Last Checked 1 month ago
Abstract
Many imaging tasks require global information about all pixels in an image. Conventional bottom-up classification networks globalize information by decreasing resolution; features are pooled and downsampled into a single output. But for semantic segmentation and object detection tasks, a network must provide higher-resolution pixel-level outputs. To globalize information while preserving resolution, many researchers propose the inclusion of sophisticated auxiliary blocks, but these come at the cost of a considerable increase in network size and computational cost. This paper proposes stacked u-nets (SUNets), which iteratively combine features from different resolution scales while maintaining resolution. SUNets leverage the information globalization power of u-nets in a deeper network architectures that is capable of handling the complexity of natural images. SUNets perform extremely well on semantic segmentation tasks using a small number of parameters.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

๐Ÿ“œ Similar Papers

In the same crypt โ€” Computer Vision