Paper Reading AI Learner

Frustratingly Easy Person Re-Identification: Generalizing Person Re-ID in Practice

2019-05-09 03:00:13
Jieru Jia, Qiuqi Ruan, Timothy M. Hospedales

Abstract

Contemporary person re-identification (Re-ID) methods usually require access to data from the deployment camera network during training in order to perform well. This is because contemporary Re-ID models trained on one dataset do not generalise to other camera networks due to the domain-shift between datasets. This requirement is often the bottleneck for deploying Re-ID systems in practical security or commercial applications as it may be impossible to collect this data in advance or prohibitively costly to annotate it. This paper alleviates this issue by proposing a simple baseline for domain generalizable~(DG) person re-identification. That is, to learn a Re-ID model from a set of source domains that is suitable for application to unseen datasets out-of-the-box, without any model updating. Specifically, we observe that the domain discrepancy in Re-ID is due to style and content variance across datasets and demonstrate appropriate Instance and Feature Normalization alleviates much of the resulting domain-shift in Deep Re-ID models. Instance Normalization~(IN) in early layers filters out style statistic variations and Feature Normalization~(FN) in deep layers is able to further eliminate disparity in content statistics. Compared to contemporary alternatives, this approach is extremely simple to implement, while being faster to train and test, thus making it an extremely valuable baseline for implementing Re-ID in practice. With a few lines of code, it increases the rank 1 Re-ID accuracy by 11.7\%, 28.9\%, 10.1\% and 6.3\% on the VIPeR, PRID, GRID, and i-LIDS benchmarks respectively. Source code will be made available.

Abstract (translated)

现代的人再识别(Re-ID)方法通常需要在训练期间访问部署摄像头网络中的数据,以便表现良好。这是因为在一个数据集上训练的现代RE-ID模型由于数据集之间的域转换而不通用于其他摄像机网络。这一需求通常是在实际安全或商业应用中部署REID系统的瓶颈,因为可能无法提前收集这些数据,或者注释这些数据的成本过高。本文通过提出一个简单的域可推广~(dg)人再识别基线,来缓解这一问题。也就是说,要从一组源域中学习REID模型,该源域适合应用程序在不更新任何模型的情况下从框中看不到数据集。具体来说,我们观察到REID中的域差异是由于数据集之间的样式和内容差异造成的,并且证明了适当的实例和特征规范化可以减轻深层REID模型中产生的域变化。实例归一化(in)可以滤除早期层次的风格统计变化,而深层层次的特征归一化(fn)可以进一步消除内容统计中的差异。与当代的替代方法相比,这种方法非常容易实现,同时训练和测试速度更快,因此在实践中实现REID成为非常有价值的基线。通过几行代码,它将Viper、Prid、Grid和I-Lids基准的排名1 RE ID精度分别提高了11.7%、28.9%、10.1%和6.3%。源代码将可用。

URL

https://arxiv.org/abs/1905.03422

PDF

https://arxiv.org/pdf/1905.03422.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot