Paper Reading AI Learner

Adaptive Multi-Scale Correlation Meta-Network for Few-Shot Remote Sensing Image Classification

2026-01-18 08:21:51
Anurag Kaushish, Ayan Sar, Sampurna Roy, Sudeshna Chakraborty, Prashant Trivedi, Tanupriya Choudhury, Kanav Gupta

Abstract

Few-shot learning in remote sensing remains challenging due to three factors: the scarcity of labeled data, substantial domain shifts, and the multi-scale nature of geospatial objects. To address these issues, we introduce Adaptive Multi-Scale Correlation Meta-Network (AMC-MetaNet), a lightweight yet powerful framework with three key innovations: (i) correlation-guided feature pyramids for capturing scale-invariant patterns, (ii) an adaptive channel correlation module (ACCM) for learning dynamic cross-scale relationships, and (iii) correlation-guided meta-learning that leverages correlation patterns instead of conventional prototype averaging. Unlike prior approaches that rely on heavy pre-trained models or transformers, AMC-MetaNet is trained from scratch with only $\sim600K$ parameters, offering $20\times$ fewer parameters than ResNet-18 while maintaining high efficiency ($<50$ms per image inference). AMC-MetaNet achieves up to 86.65\% accuracy in 5-way 5-shot classification on various remote sensing datasets, including EuroSAT, NWPU-RESISC45, UC Merced Land Use, and AID. Our results establish AMC-MetaNet as a computationally efficient, scale-aware framework for real-world few-shot remote sensing.

Abstract (translated)

在遥感领域中,少量样本学习(few-shot learning)仍面临三大挑战:标记数据的稀缺性、显著的域偏移以及地理空间对象的多尺度特性。为了解决这些问题,我们引入了自适应多尺度相关元网络(Adaptive Multi-Scale Correlation Meta-Network, AMC-MetaNet),这是一种轻量级但功能强大的框架,具有三项关键创新: 1. 相关引导特征金字塔:用于捕获尺度不变的模式。 2. 自适应通道相关模块(ACCM):用于学习动态跨尺度关系。 3. 基于相关性的元学习:利用相关性模式而非传统的原型平均。 与以往依赖重型预训练模型或变换器的方法不同,AMC-MetaNet从头开始训练,仅使用约60万个参数。这使得它比ResNet-18少20倍的参数量,同时保持了高效性(每张图像推理时间低于50毫秒)。在包括EuroSAT、NWPU-RESISC45、UC Merced Land Use和AID在内的多个遥感数据集上,AMC-MetaNet实现了高达86.65%的精度,在五类五样本分类任务中表现优异。我们的研究结果将AMC-MetaNet确立为一种计算效率高且尺度感知度强的框架,适用于现实世界的少量样本遥感应用。

URL

https://arxiv.org/abs/2601.12308

PDF

https://arxiv.org/pdf/2601.12308.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Time_Series Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot