Paper Reading AI Learner

Data-Efficient Sequence-Based Visual Place Recognition with Highly Compressed JPEG Images

2023-02-26 13:13:51
Mihnea-Alexandru Tomita, Bruno Ferrarini, Michael Milford, Klaus McDonald-Maier, Shoaib Ehsan

Abstract

Visual Place Recognition (VPR) is a fundamental task that allows a robotic platform to successfully localise itself in the environment. For decentralised VPR applications where the visual data has to be transmitted between several agents, the communication channel may restrict the localisation process when limited bandwidth is available. JPEG is an image compression standard that can employ high compression ratios to facilitate lower data transmission for VPR applications. However, when applying high levels of JPEG compression, both the image clarity and size are drastically reduced. In this paper, we incorporate sequence-based filtering in a number of well-established, learnt and non-learnt VPR techniques to overcome the performance loss resulted from introducing high levels of JPEG compression. The sequence length that enables 100% place matching performance is reported and an analysis of the amount of data required for each VPR technique to perform the transfer on the entire spectrum of JPEG compression is provided. Moreover, the time required by each VPR technique to perform place matching is investigated, on both uniformly and non-uniformly JPEG compressed data. The results show that it is beneficial to use a highly compressed JPEG dataset with an increased sequence length, as similar levels of VPR performance are reported at a significantly reduced bandwidth. The results presented in this paper also emphasize that there is a trade-off between the amount of data transferred and the total time required to perform VPR. Our experiments also suggest that is often favourable to compress the query images to the same quality of the map, as more efficient place matching can be performed. The experiments are conducted on several VPR datasets, under mild to extreme JPEG compression.

Abstract (translated)

视觉位置识别(VPR)是使机器人平台在环境中成功定位的基本任务。对于分散的VPR应用,视觉数据需要在多个代理之间传输,当有限的带宽可用时,通信通道可能会限制定位过程。JPEG是一种图像压缩标准,可以使用高压缩比来促进VPR应用较低的数据传输。然而,当应用采用高版本的JPEG压缩时,图像清晰度和大小都急剧下降。在本文中,我们将序列过滤纳入许多稳定且非学习的VPR技术中,以克服引入高版本的JPEG压缩所带来的性能损失。报道了能够实现100%位置匹配性能的序列长度,并对每个VPR技术在JPEG压缩光谱范围内的数据传输所需的数据量进行了分析。此外,我们还研究了每个VPR技术进行位置匹配所需的时间,在均匀和非均匀JPEG压缩的数据上进行了研究。结果表明,使用高压缩度的JPEG数据集和增加序列长度的选项 beneficial,因为相似的VPR性能在带宽显著减少的情况下出现。本文还强调,数据传输量与VPR总时间之间的权衡存在。我们的实验还表明,通常将查询图像压缩到地图质量相同的水平有利于进行更高效的位置匹配。在轻微的到极端的JPEG压缩条件下,对多个VPR数据集进行了实验。

URL

https://arxiv.org/abs/2302.13314

PDF

https://arxiv.org/pdf/2302.13314.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot