Paper Reading AI Learner

GaitMorph: Transforming Gait by Optimally Transporting Discrete Codes

2023-07-27 09:09:28
Adrian Cosma, Emilian Radoi

Abstract

Gait, the manner of walking, has been proven to be a reliable biometric with uses in surveillance, marketing and security. A promising new direction for the field is training gait recognition systems without explicit human annotations, through self-supervised learning approaches. Such methods are heavily reliant on strong augmentations for the same walking sequence to induce more data variability and to simulate additional walking variations. Current data augmentation schemes are heuristic and cannot provide the necessary data variation as they are only able to provide simple temporal and spatial distortions. In this work, we propose GaitMorph, a novel method to modify the walking variation for an input gait sequence. Our method entails the training of a high-compression model for gait skeleton sequences that leverages unlabelled data to construct a discrete and interpretable latent space, which preserves identity-related features. Furthermore, we propose a method based on optimal transport theory to learn latent transport maps on the discrete codebook that morph gait sequences between variations. We perform extensive experiments and show that our method is suitable to synthesize additional views for an input sequence.

Abstract (translated)

步态(Gait)是步行的方式,已被证明是一种可靠的生物特征,可用于监视、营销和安保等领域。一个有前途的新方向是通过自监督学习方法训练输入步态序列的步态识别系统。这些方法 heavily reliant on strong augmentations for the same walking sequence to induce more data variability and to simulate additional walking variations. 当前的数据增强方案是启发式的,无法提供必要的数据变化,因为它们只能提供简单的时间和空间扭曲。在本研究中,我们提出了步态变形方法(Gait Morph),一种修改输入步态序列的新方法。我们的方法涉及训练一个高压缩的步态骨骼序列模型,利用未标记数据建立离散且可解释的潜在空间,以维持身份相关特征。此外,我们提出了基于最优传输理论的方法,学习离散代码book中的过渡传输映射,以morph步态序列之间的变化。我们进行了广泛的实验,并表明我们的方法适合合成输入序列的额外视角。

URL

https://arxiv.org/abs/2307.14713

PDF

https://arxiv.org/pdf/2307.14713.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot