Towards Resolving the Challenge of Long-tail Distribution in UAV Images for Object Detection

November 07, 2020 ยท Entered Twilight ยท ๐Ÿ› IEEE Workshop/Winter Conference on Applications of Computer Vision

๐ŸŒ… TWILIGHT: Old Age
Predates the code-sharing era โ€” a pioneer of its time

"Last commit was 5.0 years ago (โ‰ฅ5 year threshold)"

Evidence collected by the PWNC Scanner

Repo contents: .dev_scripts, .github, .gitignore, .pre-commit-config.yaml, .readthedocs.yml, LICENSE, README.md, configs, demo, docker, docs, dshnet_0000116_00351_d_0000083.jpg, fig2.png, mmdet, pytest.ini, requirements.txt, requirements, resources, setup.cfg, setup.py, tests, tools

Authors Weiping Yu, Taojiannan Yang, Chen Chen arXiv ID 2011.03822 Category cs.CV: Computer Vision Cross-listed cs.AI Citations 108 Venue IEEE Workshop/Winter Conference on Applications of Computer Vision Repository https://github.com/we1pingyu/DSHNet โญ 54 Last Checked 1 month ago
Abstract
Existing methods for object detection in UAV images ignored an important challenge - imbalanced class distribution in UAV images - which leads to poor performance on tail classes. We systematically investigate existing solutions to long-tail problems and unveil that re-balancing methods that are effective on natural image datasets cannot be trivially applied to UAV datasets. To this end, we rethink long-tailed object detection in UAV images and propose the Dual Sampler and Head detection Network (DSHNet), which is the first work that aims to resolve long-tail distribution in UAV images. The key components in DSHNet include Class-Biased Samplers (CBS) and Bilateral Box Heads (BBH), which are developed to cope with tail classes and head classes in a dual-path manner. Without bells and whistles, DSHNet significantly boosts the performance of tail classes on different detection frameworks. Moreover, DSHNet significantly outperforms base detectors and generic approaches for long-tail problems on VisDrone and UAVDT datasets. It achieves new state-of-the-art performance when combining with image cropping methods. Code is available at https://github.com/we1pingyu/DSHNet
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

๐Ÿ“œ Similar Papers

In the same crypt โ€” Computer Vision