Paper Reading AI Learner

PureForest: A Large-scale Aerial Lidar and Aerial Imagery Dataset for Tree Species Classification in Monospecific Forests

2024-04-18 10:23:10
Charles Gaydon, Floryne Roche

Abstract

Knowledge of tree species distribution is fundamental to managing forests. New deep learning approaches promise significant accuracy gains for forest mapping, and are becoming a critical tool for mapping multiple tree species at scale. To advance the field, deep learning researchers need large benchmark datasets with high-quality annotations. To this end, we present the PureForest dataset: a large-scale, open, multimodal dataset designed for tree species classification from both Aerial Lidar Scanning (ALS) point clouds and Very High Resolution (VHR) aerial images. Most current public Lidar datasets for tree species classification have low diversity as they only span a small area of a few dozen annotated hectares at most. In contrast, PureForest has 18 tree species grouped into 13 semantic classes, and spans 339 km$^2$ across 449 distinct monospecific forests, and is to date the largest and most comprehensive Lidar dataset for the identification of tree species. By making PureForest publicly available, we hope to provide a challenging benchmark dataset to support the development of deep learning approaches for tree species identification from Lidar and/or aerial imagery. In this data paper, we describe the annotation workflow, the dataset, the recommended evaluation methodology, and establish a baseline performance from both 3D and 2D modalities.

Abstract (translated)

树木物种分布的了解是管理森林的基础。新的深度学习方法预计将在森林制图方面显著提高准确性,并成为放大规模绘制多种树种的 critical 工具。为了推动该领域的发展,深度学习研究人员需要大型高质量标注的数据集。为此,我们提出了 PureForest 数据集:一个大规模、开放、多模态的数据集,旨在从空域激光扫描(ALS)点云和非常高分辨率(VHR)航空图像中对树木物种进行分类。目前,大多数公开的 Lidar 数据集用于树木物种分类时具有较低的多样性,因为它们只覆盖了少数 annotated 公顷的土地。相比之下,PureForest 把 18 种树木分成了 13 个语义类,跨越了 449 个不同的单一森林,迄今为止是最大的、最全面的 Lidar 数据集,用于识别树木物种。通过将 PureForest 公开发布,我们希望为开发从 Lidar 和/或航空影像中识别树木物种的深度学习方法提供一个具有挑战性的基准数据集。在本文的数据论文中,我们描述了标注工作流程、数据集、推荐的评估方法和从 3D 和 2D 模态中建立基准性能。

URL

https://arxiv.org/abs/2404.12064

PDF

https://arxiv.org/pdf/2404.12064.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot