Paper Reading AI Learner

An Advanced Features Extraction Module for Remote Sensing Image Super-Resolution

2024-05-07 18:15:51
Naveed Sultan, Amir Hajian, Supavadee Aramvith

Abstract

In recent years, convolutional neural networks (CNNs) have achieved remarkable advancement in the field of remote sensing image super-resolution due to the complexity and variability of textures and structures in remote sensing images (RSIs), which often repeat in the same images but differ across others. Current deep learning-based super-resolution models focus less on high-frequency features, which leads to suboptimal performance in capturing contours, textures, and spatial information. State-of-the-art CNN-based methods now focus on the feature extraction of RSIs using attention mechanisms. However, these methods are still incapable of effectively identifying and utilizing key content attention signals in RSIs. To solve this problem, we proposed an advanced feature extraction module called Channel and Spatial Attention Feature Extraction (CSA-FE) for effectively extracting the features by using the channel and spatial attention incorporated with the standard vision transformer (ViT). The proposed method trained over the UCMerced dataset on scales 2, 3, and 4. The experimental results show that our proposed method helps the model focus on the specific channels and spatial locations containing high-frequency information so that the model can focus on relevant features and suppress irrelevant ones, which enhances the quality of super-resolved images. Our model achieved superior performance compared to various existing models.

Abstract (translated)

近年来,卷积神经网络(CNNs)在远程 sensing图像超分辨率领域取得了显著进展,这是由于远程 sensing图像(RSIs)中纹理和结构复杂性和变异性,这些图像在同一图像上重复,但在其他图像上不同。目前基于深度学习的超分辨率方法更加关注低频特征,这导致在捕捉轮廓、纹理和空间信息方面性能较低。最先进的基于CNN的超级分辨率方法现在专注于使用注意力机制提取RSIs的特征。然而,这些方法仍然无法有效地识别和利用RSIs中的关键内容关注信号。为了解决这个问题,我们提出了一个名为通道和空间注意力特征提取(CSA-FE)的高级特征提取模块,通过使用与标准视觉Transformer(ViT)集成的通道和空间注意力来有效地提取特征。在训练方面,我们使用UC Merced数据集在规模2、3和4上进行训练。实验结果表明,与各种现有模型相比,我们提出的方法有助于模型集中于包含高频信息的特定通道和空间位置,使模型可以关注相关特征并抑制无关特征,从而提高超分辨率图像的质量。我们的模型在各种现有模型中具有卓越的性能。

URL

https://arxiv.org/abs/2405.04595

PDF

https://arxiv.org/pdf/2405.04595.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot