End-to-End 3D Hand Pose Estimation from Stereo Cameras

2022-06-03 04:18:58

Yuncheng Li, Zehao Xue, Yingying Wang, Liuhao Ge, Zhou Ren, Jonathan Rodriguez

arXiv_CV

Abstract
Abstract (translated)
URL
PDF

Abstract

This work proposes an end-to-end approach to estimate full 3D hand pose from stereo cameras. Most existing methods of estimating hand pose from stereo cameras apply stereo matching to obtain depth map and use depth-based solution to estimate hand pose. In contrast, we propose to bypass the stereo matching and directly estimate the 3D hand pose from the stereo image pairs. The proposed neural network architecture extends from any keypoint predictor to estimate the sparse disparity of the hand joints. In order to effectively train the model, we propose a large scale synthetic dataset that is composed of stereo image pairs and ground truth 3D hand pose annotations. Experiments show that the proposed approach outperforms the existing methods based on the stereo depth.

Abstract (translated)

URL

https://arxiv.org/abs/2206.01384

PDF

https://arxiv.org/pdf/2206.01384.pdf