Distributional Instance Segmentation: Modeling Uncertainty and High Confidence Predictions with Latent-MaskRCNN

May 03, 2023 ยท Entered Twilight ยท ๐Ÿ› IEEE International Conference on Robotics and Automation

๐Ÿ’ค TWILIGHT: Eternal Rest
Repo abandoned since publication

"No code URL or promise found in abstract"
"Code repo scraped from project page (backfill)"

Evidence collected by the PWNC Scanner

Repo contents: .circleci, .clang-format, .flake8, .github, .gitignore, .gitmodules, GETTING_STARTED.md, INSTALL.md, LICENSE, MODEL_ZOO.md, README.md, benchmarks, cocoapi, configs, datasets, demo, detectron2, dev, docker, docs, projects, setup.cfg, setup.py, tests, tools

Authors YuXuan Liu, Nikhil Mishra, Pieter Abbeel, Xi Chen arXiv ID 2305.01910 Category cs.CV: Computer Vision Cross-listed cs.AI, cs.RO Citations 4 Venue IEEE International Conference on Robotics and Automation Repository https://github.com/wyndwarrior/latent-maskrcnn โญ 4 Last Checked 6 days ago
Abstract
Object recognition and instance segmentation are fundamental skills in any robotic or autonomous system. Existing state-of-the-art methods are often unable to capture meaningful uncertainty in challenging or ambiguous scenes, and as such can cause critical errors in high-performance applications. In this paper, we explore a class of distributional instance segmentation models using latent codes that can model uncertainty over plausible hypotheses of object masks. For robotic picking applications, we propose a confidence mask method to achieve the high precision necessary in industrial use cases. We show that our method can significantly reduce critical errors in robotic systems, including our newly released dataset of ambiguous scenes in a robotic application. On a real-world apparel-picking robot, our method significantly reduces double pick errors while maintaining high performance.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

๐Ÿ“œ Similar Papers

In the same crypt โ€” Computer Vision