Paper Reading AI Learner

BundledSLAM: An Accurate Visual SLAM System Using Multiple Cameras

2024-03-28 23:52:41
Han Song, Cong Liu, Huafeng Dai

Abstract

Multi-camera SLAM systems offer a plethora of advantages, primarily stemming from their capacity to amalgamate information from a broader field of view, thereby resulting in heightened robustness and improved localization accuracy. In this research, we present a significant extension and refinement of the state-of-the-art stereo SLAM system, known as ORB-SLAM2, with the objective of attaining even higher this http URL accomplish this objective, we commence by mapping measurements from all cameras onto a virtual camera termed BundledFrame. This virtual camera is meticulously engineered to seamlessly adapt to multi-camera configurations, facilitating the effective fusion of data captured from multiple cameras. Additionally, we harness extrinsic parameters in the bundle adjustment (BA) process to achieve precise trajectory estimation.Furthermore, we conduct an extensive analysis of the role of bundle adjustment (BA) in the context of multi-camera scenarios, delving into its impact on tracking, local mapping, and global optimization. Our experimental evaluation entails comprehensive comparisons between ground truth data and the state-of-the-art SLAM system. To rigorously assess the system's performance, we utilize the EuRoC datasets. The consistent results of our evaluations demonstrate the superior accuracy of our system in comparison to existing approaches.

Abstract (translated)

多摄像头SLAM系统具有诸多优势,主要源于其能够整合更广泛视角下的信息,从而实现更高的稳健性和更准确的局部定位精度。在本文中,我们提出了对最先进的立体SLAM系统ORB-SLAM2的重大拓展和改进,旨在实现更高的性能,我们在开始时将来自所有摄像头的测量结果映射到一个虚拟相机,称之为BundledFrame。这个虚拟相机精心设计,以无缝适应多种摄像机配置,促进多个摄像头捕获到的数据的有效融合。此外,我们还利用 bundle adjustment(BA)过程中提取的外部参数来实现精确的运动轨迹估计。 此外,我们对多摄像头场景中 bundle adjustment(BA)的作用进行了深入分析,探讨了其对跟踪、局部建模和全局优化的影响。我们的实验评估是对真实数据和最先进的SLAM系统之间的全面比较。为了严谨评估系统的性能,我们使用了EuRoC数据集。评估结果表明,我们的系统的性能优于现有方法。

URL

https://arxiv.org/abs/2403.19886

PDF

https://arxiv.org/pdf/2403.19886.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot