Paper Reading AI Learner

Benchmarking the Robustness of UAV Tracking Against Common Corruptions

2024-03-18 02:39:21
Xiaoqiong Liu, Yunhe Feng, Shu Hu, Xiaohui Yuan, Heng Fan

Abstract

The robustness of unmanned aerial vehicle (UAV) tracking is crucial in many tasks like surveillance and robotics. Despite its importance, little attention is paid to the performance of UAV trackers under common corruptions due to lack of a dedicated platform. Addressing this, we propose UAV-C, a large-scale benchmark for assessing robustness of UAV trackers under common corruptions. Specifically, UAV-C is built upon two popular UAV datasets by introducing 18 common corruptions from 4 representative categories including adversarial, sensor, blur, and composite corruptions in different levels. Finally, UAV-C contains more than 10K sequences. To understand the robustness of existing UAV trackers against corruptions, we extensively evaluate 12 representative algorithms on UAV-C. Our study reveals several key findings: 1) Current trackers are vulnerable to corruptions, indicating more attention needed in enhancing the robustness of UAV trackers; 2) When accompanying together, composite corruptions result in more severe degradation to trackers; and 3) While each tracker has its unique performance profile, some trackers may be more sensitive to specific corruptions. By releasing UAV-C, we hope it, along with comprehensive analysis, serves as a valuable resource for advancing the robustness of UAV tracking against corruption. Our UAV-C will be available at this https URL.

Abstract (translated)

无人机(UAV)跟踪的稳健性在许多任务中至关重要,如监视和机器人技术。尽管稳健性非常重要,但很少关注在普通污损下无人机跟踪器的性能。为了解决这个问题,我们提出了UAV-C,一个评估无人机跟踪器在常见污损下的稳健性的大型基准。具体来说,UAV-C基于两个流行的UAV数据集,引入了包括恶意、传感器、模糊和组合污损在内的18种常见污损水平。最后,UAV-C包含超过10K个序列。为了了解现有UAV跟踪器对污损的稳健性,我们详细评估了UAV-C上的12个代表算法。我们的研究揭示了几个关键发现:1)当前的跟踪器对污损非常脆弱,表明需要更多地关注增强UAV跟踪器的稳健性;2)当一起出现时,组合污损会导致跟踪器性能的更严重恶化;3)虽然每个跟踪器都有其独特的性能概况,但有些跟踪器可能对特定污损更加敏感。通过发布UAV-C,我们希望它,再加上全面的分析,成为提高无人机跟踪器对抗污损的有价值的资源。我们的UAV-C将在此链接上可用:https://www.url。

URL

https://arxiv.org/abs/2403.11424

PDF

https://arxiv.org/pdf/2403.11424.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot