Paper Reading AI Learner

MotionTrack: Learning Robust Short-term and Long-term Motions for Multi-Object Tracking

2023-03-18 12:38:33
Zheng Qin, Sanping Zhou, Le Wang, Jinghai Duan, Gang Hua, Wei Tang

Abstract

The main challenge of Multi-Object Tracking~(MOT) lies in maintaining a continuous trajectory for each target. Existing methods often learn reliable motion patterns to match the same target between adjacent frames and discriminative appearance features to re-identify the lost targets after a long period. However, the reliability of motion prediction and the discriminability of appearances can be easily hurt by dense crowds and extreme occlusions in the tracking process. In this paper, we propose a simple yet effective multi-object tracker, i.e., MotionTrack, which learns robust short-term and long-term motions in a unified framework to associate trajectories from a short to long range. For dense crowds, we design a novel Interaction Module to learn interaction-aware motions from short-term trajectories, which can estimate the complex movement of each target. For extreme occlusions, we build a novel Refind Module to learn reliable long-term motions from the target's history trajectory, which can link the interrupted trajectory with its corresponding detection. Our Interaction Module and Refind Module are embedded in the well-known tracking-by-detection paradigm, which can work in tandem to maintain superior performance. Extensive experimental results on MOT17 and MOT20 datasets demonstrate the superiority of our approach in challenging scenarios, and it achieves state-of-the-art performances at various MOT metrics.

Abstract (translated)

多目标跟踪~(MOT)的主要挑战在于维持每个目标持续的跟踪轨迹。现有的方法通常学习可靠的运动模式,在相邻帧之间匹配相同的目标,并在长时间内识别丢失的目标。然而,在跟踪过程中,密集的人群和极端的遮挡可能会伤害运动预测的可靠性和外貌区分的精度。在本文中,我们提出了一种简单但有效的多目标跟踪器,即 MotionTrack,它在一个统一框架中学习可靠的短期和长期运动,将轨迹从短到长range联系起来。对于密集的人群,我们设计了一个新相互作用模块,从短期轨迹中学习相互作用 aware 的运动,可以估计每个目标的复杂运动。对于极端遮挡,我们建立了一个新找回模块,从目标的历史轨迹中学习可靠的长期运动,可以链接中断的轨迹与相应的检测。我们的相互作用模块和找回模块嵌入了著名的跟踪-检测范式,可以协同工作以保持更好的性能。在 MOT17 和 MOT20 数据集上的广泛实验结果表明,我们在挑战性场景中的优越性,并且它在不同 MOT 指标上实现了最先进的性能。

URL

https://arxiv.org/abs/2303.10404

PDF

https://arxiv.org/pdf/2303.10404.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot