Paper Reading AI Learner

Spatio-Temporal Attention Network for Persistent Monitoring of Multiple Mobile Targets

2023-03-11 08:53:37
Yizhuo Wang, Yutong Wang, Yuhong Cao, Guillaume Sartoretti

Abstract

This work focuses on the persistent monitoring problem, where a set of targets moving based on an unknown model must be monitored by an autonomous mobile robot with a limited sensing range. To keep each target's position estimate as accurate as possible, the robot needs to adaptively plan its path to (re-)visit all the targets and update its belief from measurements collected along the way. In doing so, the main challenge is to strike a balance between exploitation, i.e., re-visiting previously-located targets, and exploration, i.e., finding new targets or re-acquiring lost ones. Encouraged by recent advances in deep reinforcement learning, we introduce an attention-based neural solution to the persistent monitoring problem, where the agent can learn the inter-dependencies between targets, i.e., their spatial and temporal correlations, conditioned on past measurements. This endows the agent with the ability to determine which target, time, and location to attend to across multiple scales, which we show also helps relax the usual limitations of a finite target set. We experimentally demonstrate that our method outperforms other baselines in terms of number of targets visits and average estimation error in complex environments. Finally, we implement and validate our model in a drone-based simulation experiment to monitor mobile ground targets in a high-fidelity simulator.

Abstract (translated)

这项工作重点是持久的监测问题,该问题涉及一组基于未知模型移动的目标,必须由一只具有有限感知范围的自主移动机器人进行监测。为了尽可能准确地保持每个目标的位置估计,机器人需要自适应地规划其路径,(再次)访问所有目标并更新其信念从沿途收集的测量数据。在这个过程中,主要挑战是在利用和探索之间的平衡之间取得平衡,即重新访问以前位置的目标,或寻找新的目标或重新获取丢失的目标。受到最近深度学习进展的鼓舞,我们介绍了一种基于注意力的神经网络解决方案来解决持久的监测问题,该方案使Agent能够学习目标之间的相互依赖性,即它们的空间和时间 correlation conditioning on 过去测量数据。这赋予Agent能力,确定在不同尺度上 attend 到哪些目标、时间和位置,我们表明这也有助于放松有限目标集合的常见限制。我们实验表明,我们的方法在复杂环境中的访问目标数量和平均估计误差方面优于其他基准方法。最后,我们使用无人机模拟实验实现了并验证了我们的模型,以监测在高保真模拟中移动的地面目标。

URL

https://arxiv.org/abs/2303.06350

PDF

https://arxiv.org/pdf/2303.06350.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot