Paper Reading AI Learner

Towards general deep-learning-based tree instance segmentation models

2024-05-03 12:42:43
Jonathan Henrich, Jan van Delden

Abstract

The segmentation of individual trees from forest point clouds is a crucial task for downstream analyses such as carbon sequestration estimation. Recently, deep-learning-based methods have been proposed which show the potential of learning to segment trees. Since these methods are trained in a supervised way, the question arises how general models can be obtained that are applicable across a wide range of settings. So far, training has been mainly conducted with data from one specific laser scanning type and for specific types of forests. In this work, we train one segmentation model under various conditions, using seven diverse datasets found in literature, to gain insights into the generalization capabilities under domain-shift. Our results suggest that a generalization from coniferous dominated sparse point clouds to deciduous dominated high-resolution point clouds is possible. Conversely, qualitative evidence suggests that generalization from high-resolution to low-resolution point clouds is challenging. This emphasizes the need for forest point clouds with diverse data characteristics for model development. To enrich the available data basis, labeled trees from two previous works were propagated to the complete forest point cloud and are made publicly available at this https URL.

Abstract (translated)

从森林点云中提取单个树木是一个关键的任务,对于诸如碳储存估计等下游分析具有至关重要的意义。最近,基于深度学习的方法已经被提出,展示了从学习分割树木的潜力。由于这些方法以有监督的方式进行训练,因此问题是如何获得适用于各种设置的通用的模型。到目前为止,主要使用来自特定激光扫描类型的数据和特定类型的森林进行训练。在本文中,我们使用来自文献中七种不同数据集的一个分割模型,在各种条件下进行训练,以探究领域漂移下的泛化能力。我们的结果表明,从针叶林点云到落叶林点云的泛化是可能的。相反,定性证据表明,从高分辨率到低分辨率点云的泛化具有挑战性。这强调了需要具有多样数据特征的森林点云来支持模型开发。为了丰富现有的数据基础,来自两个以前工作的带有标签的树木被传播到完整的森林点云,并在此处公开发布,链接在此:https://www.xxx。

URL

https://arxiv.org/abs/2405.02061

PDF

https://arxiv.org/pdf/2405.02061.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot