PERCH 2.0 : Fast and Accurate GPU-based Perception via Search for Object Pose Estimation

August 01, 2020 · Entered Twilight · 🏛 IEEE/RJS International Conference on Intelligent RObots and Systems

"No code URL or promise found in abstract"
"Derived repo from GitHub Pages (backfill)"

Evidence collected by the PWNC Scanner

Repo contents: .gitignore, LICENSE, README.md, _config.yml, cuda_icp, cuda_renderer, docker, images, index.html, kinect_sim, object_recognition_node, params.json, perception_utils, perch.rosinstall, sbpl_perception, stylesheets

Authors Aditya Agarwal, Yupeng Han, Maxim Likhachev arXiv ID 2008.00326 Category cs.CV: Computer Vision Cross-listed cs.RO Citations 7 Venue IEEE/RJS International Conference on Intelligent RObots and Systems Repository https://github.com/sbpl-cruz/perception ⭐ 16 Last Checked 8 days ago

Abstract

Pose estimation of known objects is fundamental to tasks such as robotic grasping and manipulation. The need for reliable grasping imposes stringent accuracy requirements on pose estimation in cluttered, occluded scenes in dynamic environments. Modern methods employ large sets of training data to learn features in order to find correspondence between 3D models and observed data. However these methods require extensive annotation of ground truth poses. An alternative is to use algorithms that search for the best explanation of the observed scene in a space of possible rendered scenes. A recently developed algorithm, PERCH (PErception Via SeaRCH) does so by using depth data to converge to a globally optimum solution using a search over a specially constructed tree. While PERCH offers strong guarantees on accuracy, the current formulation suffers from low scalability owing to its high runtime. In addition, the sole reliance on depth data for pose estimation restricts the algorithm to scenes where no two objects have the same shape. In this work, we propose PERCH 2.0, a novel perception via search strategy that takes advantage of GPU acceleration and RGB data. We show that our approach can achieve a speedup of 100x over PERCH, as well as better accuracy than the state-of-the-art data-driven approaches on 6-DoF pose estimation without the need for annotating ground truth poses in the training data. Our code and video are available at https://sbpl-cruz.github.io/perception/.