Robust RGB-based 6-DoF Pose Estimation without Real Pose Annotations

2020-08-19 12:07:01

Zhigang Li, Yinlin Hu, Mathieu Salzmann, Xiangyang Ji

arXiv_CV

Abstract
Abstract (translated)
URL
PDF

Abstract

While much progress has been made in 6-DoF object pose estimation from a single RGB image, the current leading approaches heavily rely on real-annotation data. As such, they remain sensitive to severe occlusions, because covering all possible occlusions with annotated data is intractable. In this paper, we introduce an approach to robustly and accurately estimate the 6-DoF pose in challenging conditions and without using any real pose annotations. To this end, we leverage the intuition that the poses predicted by a network from an image and from its counterpart synthetically altered to mimic occlusion should be consistent, and translate this to a self-supervised loss function. Our experiments on LINEMOD, Occluded-LINEMOD, YCB and new Randomization LINEMOD dataset evidence the robustness of our approach. We achieve state of the art performance on LINEMOD, and OccludedLINEMOD in without real-pose setting, even outperforming methods that rely on real annotations during training on Occluded-LINEMOD.

Abstract (translated)

URL

https://arxiv.org/abs/2008.08391

PDF

https://arxiv.org/pdf/2008.08391.pdf