Dynamic gesture retrieval: searching videos by human pose sequence

Abstract
Abstract (translated)
URL
PDF

Abstract

The number of static human poses is limited, it is hard to retrieve the exact videos using one single pose as the clue. However, with a pose sequence or a dynamic gesture as the keyword, retrieving specific videos becomes more feasible. We propose a novel method for querying videos containing a designated sequence of human poses, whereas previous works only designate a single static pose. The proposed method takes continuous 3d human poses from keyword gesture video and video candidates, then converts each pose in individual frames into bone direction descriptors, which describe the direction of each natural connection in articulated pose. A temporal pyramid sliding window is then applied to find matches between designated gesture and video candidates, which ensures that same gestures with different duration can be matched.

Abstract (translated)

URL

https://arxiv.org/abs/2006.07604

PDF

https://arxiv.org/pdf/2006.07604.pdf