Abstract
Identifying humans with their walking sequences, known as gait recognition, is a useful biometric understanding task as it can be observed from a long distance and does not require cooperation from the subject. Two common modalities used for representing the walking sequence of a person are silhouettes and joint skeletons. Silhouette sequences, which record the boundary of the walking person in each frame, may suffer from the variant appearances from carried-on objects and clothes of the person. Framewise joint detections are noisy and introduce some jitters that are not consistent with sequential detections. In this paper, we combine the silhouettes and skeletons and refine the framewise joint predictions for gait recognition. With temporal information from the silhouette sequences. We show that the refined skeletons can improve gait recognition performance without extra annotations. We compare our methods on four public datasets, CASIA-B, OUMVLP, Gait3D and GREW, and show state-of-the-art performance.
Abstract (translated)
识别人类的步态序列,也称为步态识别,是一项有用的生物特征理解任务,因为它可以在远处观察并不需要 subject 的合作。用于表示一个人步态序列的常见模式有两种:轮廓和关节骨骼。轮廓序列在每个帧中记录步态人的边界,可能会受到携带物品和衣服的人所穿衣服的变化影响。帧间关节检测是噪声性的,并引入了与顺序检测不一致的一些抖动。在本文中,我们将轮廓和骨骼组合在一起,并优化帧间关节预测,以进行步态识别。利用轮廓序列的时序信息,我们表明,优化的骨骼可以在不需要额外的注释的情况下提高步态识别性能。我们比较了四个公共数据集:CASIA-B、OUMVLP、Gait3D 和 GREW,并展示了最先进的性能。
URL
https://arxiv.org/abs/2304.07916