Paper Reading AI Learner

POPL-KF: A Pose-Only Geometric Representation-Based Kalman Filter for Point-Line-Based Visual-Inertial Odometry

2026-02-06 06:45:39
Aiping Wang, Zhaolong Yang, Shuwen Chen, Hai Zhang

Abstract

Mainstream Visual-inertial odometry (VIO) systems rely on point features for motion estimation and localization. However, their performance degrades in challenging scenarios. Moreover, the localization accuracy of multi-state constraint Kalman filter (MSCKF)-based VIO systems suffers from linearization errors associated with feature 3D coordinates and delayed measurement updates. To improve the performance of VIO in challenging scenes, we first propose a pose-only geometric representation for line features. Building on this, we develop POPL-KF, a Kalman filter-based VIO system that employs a pose-only geometric representation for both point and line features. POPL-KF mitigates linearization errors by explicitly eliminating both point and line feature coordinates from the measurement equations, while enabling immediate update of visual measurements. We also design a unified base-frames selection algorithm for both point and line features to ensure optimal constraints on camera poses within the pose-only measurement model. To further improve line feature quality, a line feature filter based on image grid segmentation and bidirectional optical flow consistency is proposed. Our system is evaluated on public datasets and real-world experiments, demonstrating that POPL-KF outperforms the state-of-the-art (SOTA) filter-based methods (OpenVINS, PO-KF) and optimization-based methods (PL-VINS, EPLF-VINS), while maintaining real-time performance.

Abstract (translated)

主流的视觉惯性里程计(VIO)系统依赖于点特征来进行运动估计和定位,但在挑战性的场景中其性能会下降。此外,基于多状态约束卡尔曼滤波器(MSCKF)的VIO系统的定位精度因与特征3D坐标相关的线性化误差以及延迟的测量更新而受到影响。为了提高在困难环境中的VIO性能,我们首先提出了一种仅基于姿态的几何表示方法用于直线特征。在此基础上,我们开发了POPL-KF系统,这是一种卡尔曼滤波器(Kalman filter)基的VIO系统,它为点和直线特征都采用了一种仅考虑姿态的几何表示方法。通过从测量方程中显式地消除点和线特征坐标,POPL-KF减少了线性化误差,并同时实现了视觉测量的即时更新功能。我们还设计了一个统一的基础帧选择算法,该算法适用于点和线特征,以确保在仅基于姿态的测量模型下对相机姿态施加最优约束条件。为了进一步提高直线特征的质量,提出了一种基于图像网格分割和双向光流一致性的直线特征过滤器。 我们的系统已在公开数据集及真实世界实验中进行了评估,结果表明POPL-KF优于现有的滤波方法(如OpenVINS、PO-KF)以及优化方法(如PL-VINS、EPLF-VINS),同时保持了实时性能。

URL

https://arxiv.org/abs/2602.06425

PDF

https://arxiv.org/pdf/2602.06425.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Time_Series Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot