Paper Reading AI Learner

Enhancing Person Re-Identification via Uncertainty Feature Fusion and Wise Distance Aggregation

2024-05-02 09:09:48
Quang-Huy Che, Le-Chuong Nguyen, Vinh-Tiep Nguyen

Abstract

The quest for robust Person re-identification (Re-ID) systems capable of accurately identifying subjects across diverse scenarios remains a formidable challenge in surveillance and security applications. This study presents a novel methodology that significantly enhances Person Re-Identification (Re-ID) by integrating Uncertainty Feature Fusion (UFFM) with Wise Distance Aggregation (WDA). Tested on benchmark datasets - Market-1501, DukeMTMC-ReID, and MSMT17 - our approach demonstrates substantial improvements in Rank-1 accuracy and mean Average Precision (mAP). Specifically, UFFM capitalizes on the power of feature synthesis from multiple images to overcome the limitations imposed by the variability of subject appearances across different views. WDA further refines the process by intelligently aggregating similarity metrics, thereby enhancing the system's ability to discern subtle but critical differences between subjects. The empirical results affirm the superiority of our method over existing approaches, achieving new performance benchmarks across all evaluated datasets. Code is available on Github.

Abstract (translated)

寻找在多样场景中准确识别主题的稳健Person识别(Re-ID)系统仍然是一项艰巨的挑战,尤其是在监视和安全性应用中。本研究介绍了一种通过将不确定度特征融合(UFFM)与智能距离聚合(WDA)相结合来显著增强Person Re-Identification(Re-ID)的新方法。在基准数据集- Market-1501、DukeMTMC-ReID和MSMT17上进行了测试,我们的方法在排名1准确性和平均精度(mAP)方面取得了显著改进。具体来说,UFFM利用多个图像的特征合成能力克服了在不同视角下主题外观变异性所施加的局限性。WDA通过智能聚合相似度度量进一步优化了过程,从而增强了系统在识别主题间微小但关键差异的能力。实证结果证实了我们的方法优越于现有方法,在所有评估数据集上都实现了新的性能基准。代码可在Github上获取。

URL

https://arxiv.org/abs/2405.01101

PDF

https://arxiv.org/pdf/2405.01101.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot