Paper Reading AI Learner

3D Object Detection in LiDAR Point Clouds using Graph Neural Networks

2023-01-29 19:23:01
Shreelakshmi C R, Surya S. Durbha, Gaganpreet Singh

Abstract

LiDAR (Light Detection and Ranging) is an advanced active remote sensing technique working on the principle of time of travel (ToT) for capturing highly accurate 3D information of the surroundings. LiDAR has gained wide attention in research and development with the LiDAR industry expected to reach 2.8 billion $ by 2025. Although the LiDAR dataset is of rich density and high spatial resolution, it is challenging to process LiDAR data due to its inherent 3D geometry and massive volume. But such a high-resolution dataset possesses immense potential in many applications and has great potential in 3D object detection and recognition. In this research we propose Graph Neural Network (GNN) based framework to learn and identify the objects in the 3D LiDAR point clouds. GNNs are class of deep learning which learns the patterns and objects based on the principle of graph learning which have shown success in various 3D computer vision tasks.

Abstract (translated)

激光雷达(LiDAR)是一种先进的主动遥感技术,基于时间旅行原理,用于捕捉周围环境的高精度三维信息。在研究和发展中,激光雷达受到了广泛关注,预计到2025年,激光雷达产业将达到28亿美元。尽管激光雷达数据集具有丰富的密度和高空间分辨率,但由于其固有的三维几何和巨大的体积,处理激光雷达数据具有挑战性。但是,这样高的分辨率数据在许多应用中具有巨大的潜力,并且在3D物体检测和识别方面具有巨大的潜力。在本研究中,我们提出了基于图神经网络(GNN)的框架,以学习和理解3D激光雷达点云中的物体。GNN是一种深度学习类别,基于图学习原则,通过学习模式和物体,在多种3D计算机视觉任务中取得了成功。

URL

https://arxiv.org/abs/2301.12519

PDF

https://arxiv.org/pdf/2301.12519.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot