Paper Reading AI Learner

Single Image Super-Resolution Based on Global-Local Information Synergy

2024-05-02 08:29:05
Nianzu Qiao, Lamei Di, Changyin Sun

Abstract

Although several image super-resolution solutions exist, they still face many challenges. CNN-based algorithms, despite the reduction in computational complexity, still need to improve their accuracy. While Transformer-based algorithms have higher accuracy, their ultra-high computational complexity makes them difficult to be accepted in practical applications. To overcome the existing challenges, a novel super-resolution reconstruction algorithm is proposed in this paper. The algorithm achieves a significant increase in accuracy through a unique design while maintaining a low complexity. The core of the algorithm lies in its cleverly designed Global-Local Information Extraction Module and Basic Block Module. By combining global and local information, the Global-Local Information Extraction Module aims to understand the image content more comprehensively so as to recover the global structure and local details in the image more accurately, which provides rich information support for the subsequent reconstruction process. Experimental results show that the comprehensive performance of the algorithm proposed in this paper is optimal, providing an efficient and practical new solution in the field of super-resolution reconstruction.

Abstract (translated)

尽管已经存在许多图像超分辨率解决方案,但它们仍然面临许多挑战。基于卷积神经网络(CNN)的算法,尽管在计算复杂性方面有所降低,但仍需要提高其准确性。而基于Transformer的算法具有更高的准确性,但它们的超高计算复杂性使得它们难以在实际应用中接受。为了克服现有挑战,本文提出了一种新颖的超分辨率重构算法。该算法通过独特的设计在保持低复杂性的同时实现了显著的准确率增加。算法的核心在于其巧妙设计的全局局部信息提取模块和基本模块。通过结合全局和局部信息,全局局部信息提取模块旨在更全面地理解图像内容,从而更准确地恢复图像的全局结构和局部细节,为后续的重建过程提供了丰富的信息支持。实验结果表明,本文提出的算法的全面性能最优,为超分辨率重构领域提供了一种高效且实用的全新解决方案。

URL

https://arxiv.org/abs/2405.01085

PDF

https://arxiv.org/pdf/2405.01085.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot