Single-Image Depth Prediction Makes Feature Matching Easier

2020-08-21 14:25:36

Carl Toft, Daniyar Turmukhambetov, Torsten Sattler, Fredrik Kahl, Gabriel Brostow

arXiv_CV

arXiv_CV Prediction Pose Action 3D Reconstruction Enhancement Matching

Abstract
Abstract (translated)
URL
PDF

Abstract

Good local features improve the robustness of many 3D re-localization and multi-view reconstruction pipelines. The problem is that viewing angle and distance severely impact the recognizability of a local feature. Attempts to improve appearance invariance by choosing better local feature points or by leveraging outside information, have come with pre-requisites that made some of them impractical. In this paper, we propose a surprisingly effective enhancement to local feature extraction, which improves matching. We show that CNN-based depths inferred from single RGB images are quite helpful, despite their flaws. They allow us to pre-warp images and rectify perspective distortions, to significantly enhance SIFT and BRISK features, enabling more good matches, even when cameras are looking at the same scene but in opposite directions.

Abstract (translated)

URL

https://arxiv.org/abs/2008.09497

PDF

https://arxiv.org/pdf/2008.09497.pdf