๐
๐
Old Age
DiffusionDet: Diffusion Model for Object Detection
November 17, 2022 ยท Entered Twilight ยท ๐ IEEE International Conference on Computer Vision
Repo contents: .gitignore, GETTING_STARTED.md, LICENSE, README.md, configs, demo.py, diffusiondet, teaser.png, train_net.py
Authors
Shoufa Chen, Peize Sun, Yibing Song, Ping Luo
arXiv ID
2211.09788
Category
cs.CV: Computer Vision
Citations
667
Venue
IEEE International Conference on Computer Vision
Repository
https://github.com/ShoufaChen/DiffusionDet
โญ 2243
Last Checked
1 month ago
Abstract
We propose DiffusionDet, a new framework that formulates object detection as a denoising diffusion process from noisy boxes to object boxes. During the training stage, object boxes diffuse from ground-truth boxes to random distribution, and the model learns to reverse this noising process. In inference, the model refines a set of randomly generated boxes to the output results in a progressive way. Our work possesses an appealing property of flexibility, which enables the dynamic number of boxes and iterative evaluation. The extensive experiments on the standard benchmarks show that DiffusionDet achieves favorable performance compared to previous well-established detectors. For example, DiffusionDet achieves 5.3 AP and 4.8 AP gains when evaluated with more boxes and iteration steps, under a zero-shot transfer setting from COCO to CrowdHuman. Our code is available at https://github.com/ShoufaChen/DiffusionDet.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ Computer Vision
๐
๐
Old Age
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
R.I.P.
๐ป
Ghosted
You Only Look Once: Unified, Real-Time Object Detection
๐
๐
Old Age
SSD: Single Shot MultiBox Detector
๐
๐
Old Age
Squeeze-and-Excitation Networks
R.I.P.
๐ป
Ghosted